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“ He who knows mathematics and does not make use of his know¬ 
ledge, to him applies the verse in Isaiah (c. s, v. 12), ‘They regard 
not the work of the l>ord. neither consider the operation of His 

hands/ 


'T'ut? 'Pai nfifn 


“The laws by which God has thought good to govern the Universe 
are surely subjects of lofty contemplation; and the study of that 
symbolical language by which alone these laws can be fully 
deciphered is well deserving of his [man's] noblest efforts.” 

Professor Sedgwick 


“ The living and the dead, things animate and inanimate, we 
dwellers in this world and this world wherein we dwell, are bound 
alike by physical and mathematical law.” 

D'Arcy W. Thompson 

“ The ultimate aim of embryology is the mathematical derivation 
of the adult from the distribution of growth in the germ.” 

Wilhelm His 




PREFACE TO THE THIRD EDITION 


Thirty years ago Biomathematics first appeared. The idea of applying 
mathematics to biological problems must then have seemed rather novel 
to most biologists, and perhaps a little strange. It is true that there had 
already been brilliant pioneering work, such as that of Sir Francis Gal¬ 
lon and Professor Karl Pearson, who had stressed the importance of 
exact measurement and had introduced many valuable statistical ideas; 
but the field was new, and waiting to be cultivated. 

To-day the situation is very different, especially in the widespread use 
of statistics—due in no small measure to the discoveries and the influ¬ 
ence of Sir Ronald Fisher. It is clear enough that many problems can 
only be treated statistically. When we suppose that Welshmen are 
shorter than Englishmen, or that tuberculosis is more prevalent in 
Africa than in Western Europe, we do not imagine that every Welshman 
is smaller than every Englishman, or that every African will catch T.B. 
and every Englishman remain free; rather we mean that these assertions 
are true on the average, and their truth or falsity can only be tested by 
carefully selecting samples from the populations concerned. Many bio¬ 
logical questions are of this type. 

But besides the use of statistics in interpreting experimental or obser¬ 
vational results, there have been growing up other streams of thought, 
leading to a quantitative explanation of biological phenomena. A notable 
example is the work of Professor J. B. S. Haldane, Professor Sewall 
Wright and Sir Ronald Fisher on the mathematical theory of popu¬ 
lations, natural selection, and evolution, which has been further developed 
by many other workers. Besides that, there are the mathematical prob¬ 
lems concerned in the chemical and physical properties of living crea¬ 
tures. Much still remains to be done. 

'Phis book is an attempt to present the fundamentals of mathematics 
to the biological or medical student and research worker. It is not a text¬ 
book on statistics (although two chapters in simple statistical topics are 
included); there is already a large literature on that subject from which 
the student can make his own choice. 

It seems to me that there arc several ways in which an explanation of 
mathematical processes can be helpful to a biologist. It can firstly serve 
as an introduction to the subject. 'I’his was evidently the intention of 
Dr. Feldman when he originally wrote this book, and in revising it I 
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have tried to keep to his general plan. New chapters have been added to 
explain new tools, such as matrices, which are now recognized as valu¬ 
able aids to computation. However, even mathematically educated 
research workers may from time to time meet problems requiring more 
advanced techniques: and so, as an experiment, a (somewhat con¬ 
densed) explanation of some of these techniques has been given in 
various places in the text, or references have been provided for further 
reading. It is hoped that this will be helpful in dealing with special 
problems as they arise. But these passages are not necessarily intended 
to be understood at a first reading or to discourage the student, but 
rather for later consultation as required. (The reader who finds them 
too difficult can omit them without serious loss.) I have tried to give a 
specially careful explanation of subjects for which I do not know of 
any adequate or easily accessible account elsewhere, and this has, in 
places, given these a prominence which they might not otherwise 
deser\^e. 


Another justification for a book of this kind is the study of statistical 
theory. The user of statistics will certainly be helped by some knowledge 
of the theory underlying the arithmetical processes he is using, and such 
a knowledge (always combined with a good dose of common sense) will 
help to avoid errors. But most books on the theory assume a certain 
mathematical background, and Diomatheniatics should help to supply 
that. Simple algebra, geometry and calculus will go a very long way 
towards such understanding (although the subject is growing so rapidly 
that even professional statisticians find it difficult to keep up with the 
latest de^■clopments). In some cases a process has been introduced with 
a view to its application to more advanced statistical theory. Unfortun¬ 
ately it is not always eas)’ to find simple concrete examples to illustrate 
such techniques without going beyond the limits of the book, but it is 
hoped that the explanations will not seem too difficult. 


I'inally 1 believe that this book will also appeal to mathematicians, 
both amateur and protessional, who are curious about the development 
of their own subject and interested to see its applications in the relatively 
unfamiliar field of biology. Certain methods of presentation may inter¬ 
est this type of reader. Tor instance, the definition of logarithms in 
terms of a spiral seems to lead very directly to their chief properties, 
especially that ot differentiation: this approach was used by Clifford, 
but has apparently been forgotten. The notation “ Dy ” for a derivative 
has been preferred as a rule to •• Jy J, experience both mathe¬ 

maticians .and biologists find It less confusing and logically more satis- 
actorja Also the older notation for a factorial has been used; this seems 
ery much clearer to the eye, especially in manuscript, than the newer 
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form “ ii \ ” and I am grateful to the publishers for their collaboration 
on this point of detail. 

Some of the characters appearing in this book (including Ilerlock 
Soames, Prof. Moronami, the Ngboglus, the Society Hostess, and the 
Universities of Camburgh and Kdinford) are evidently intended to be 
entirely fictional. Should they coincide in name or description with any 
real person or institution this is entirely fortuitous, and apologies are 
herewith tendered. 

Apologies are due should, despite careful checking, any acknowledge¬ 
ments have been inadvertently omitted or wrongly made, and any 
notice of such an error will be gratefully received. 

It is indeed a pleasure to record the very considerable assistance I 
have received from my friends and colleagues in the preparation of this 
revised edition—though it must of course be added that any faults are 
my own and not to be attributed to them. Dr. Vivian Feldman (son of 
the original author) has been kind enough to read through the greater 
part of the proofs, and has made very valuable suggestions. Dr. C. C. 
Spicer, Dr. R. S. Krooth, Dr. H. Griineberg and Miss B. E. Simpson 
have also kindly read parts of the manuscript; and helpful comments 
and encouragement have been received from Professor J. B. S. 1 laldane. 
Professor U. S. Penrose, Professor Lancelot Hogben, Mr. A. R. Par- 
geter. Dr. S. B. Holt and others, and some most useful suggestions from 
the publishers. 'Fo all of these I wish to tender my thanks. 'I'hanks are 
also due to Professor Sir Ronald Fisher, Dr. F. Yates, and Messrs. 
Oliver & Boyd for permission to use tables from the books Statistical 
Methods for Research Workers (Fisher) and Statistical Tables for Bio¬ 
logical, Agricultural and Medical Research (Fisher and Yates), to Pro¬ 
fessor M. G. Kendall and Messrs. Charles Griffin for the use of diagrams 
and tables from An Introduction to the Theory of Statistics, to Dr. 
W. L. M. Perry (of the National Institute for Medical Research) for 
information on International Biological Standards, to Professor L. S. 
Penrose for the use of Fig. 21.4, taken from the Annals of Eugenics, to 
Dr. H. Griineberg for his linkage data on mice, and to Mr. M. C. K. 
Tweedie for allowing me to describe his estimation theory (as yet only 
partly published). It is only fair to add congratulations to the pub¬ 
lishers and printers, who have turned a somewhat untidy manuscript 
into a well arranged and clearly presented text. 


University Colli:ge 

l.ONDON, W.C.i 
December, 1953 


CEDRIC A. B. SMITH 
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CHAPTER ONE 


INTRODUCTORY 


1.1 Mathematics and biology 

Pure Mathematics, as far as one can exactly define it, is the science 
of number and shape: or, as one may say, Arithmetic and Geometry 
in the most general sense of these words. Recently, too, pure mathe¬ 
maticians have taken a great interest in the more abstract ideas of 
classes, properties, relations, and so forth; or, as we may say, in Logic. 
Applied Mathematics consists simply in the application of the dis¬ 
coveries of Pure Mathematics to obser\'ed phenomena; and since we 
already have the sciences of “Biophysics” and “Biochemistry”, we may 
perhaps speak of “Biomathematics”, or “Mathematical Biology”, 
meaning the application of mathematical methods to Biology. This 
will accordingly cover a wide range of subjects: in fact, whenever we 
count or measure we are using mathematical ideas of the simplest 
kind; when we draw a graph to represent a set of observations, calculate 
the rate of a biochemical reaction, or try to find the chances of passing 
on a hereditary disease, we are using more complicated ideiis, but ones 
which are based in a straightforward way on the simpler ideas of 
measuring and counting. It is the purpose of this book to explain 
some of the more complicated mathematical ideas of interest to biologists, 
and to show how to use them, and how they arise fairly naturally from the 
simpler ones. It would be dishonest to claim that it is possible to make 
all ideas and methods entirely painless to the reader W'ho is completely 
new to the subject, but an endeavour will be made to smooth away 
the difficulties, and as far as possible to give biological examples of the 
use of each process as it is introduced. 

We may summarize the growth of scientific knowledge somewhat 
in this fashion. As a rule, an experimenter will have some hypothesis 
or hypotheses at the back of his mind before making any experiments. 
These hypotheses may be quite precise, or sometimes they may he 
extremely vague and tentative—but usually, only when his field of 
research is entirely new and uncharted will he explore in a random 
and undirected manner. Naturally he will be most interested to know 
how these hypotheses can be related to observable quantities, and to 
know how he can best plan his experiments to test them. If counting 
or measurement is involved, it will usually be convenient to express 
the hypotheses by mathematical formulas. Strictly speaking, these 
really amount to no more than a form of shorthand: thus the equation 
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X y — z States no more than the sentence “if we add the number 
'x' to the number ‘j’ we obtain the number ‘jsr* All such equations 
could be translated into ordinary language, though sometimes very 
clumsily and lengthily. But this form of shorthand is extremely con¬ 
venient in the next stage of the process, in which we work out the 
consequences of our hypotheses. Here we can play about with the 
formulas according to standard rules, without having to bother ourselves 
too much with what the formulas mean in practice. This formal 
manipulation means that we can deduce various consequences without 
the labour and tedium of a considerable amount of heavy reasoning. 
When in this way we have discovered what results may be expected 
to follow from our hypotheses, we shall be able to see what are the 
most useful experiments we can perform, and what sort of calculations 
we must make when we have obtained our experimental results. 
Finally, if these turn out to agree with our hypotheses, we can pro¬ 
visionally accept them as proved, and from them predict further 
consequences which may not for the time being be verifiable by obser\'a- 
tion. 


For example, when Mendel began his famous experiments on 
heredity, he must have had the idea that interesting results could be 
obtained by counting the numbers of different kinds of offspring 
obtained from two given parents. Further investigation showed that, 
for example, if we self-fertilize a pea plant which is obtained from a 
cross between a strain having yellow seeds and one having green seeds, 
then in the next generation we find approximately 3 times more yellow 
seeds than green ones. If we now assume that in the long run we get 
exactly 3 times as many yellows as greens, the theor}' of probability 
tells us that in an experiment in which looo offspring are obtained, it 
is most likely that the number of yellows will be between 714 and 
786: in tact, that will be true in 99 out of every 100 such experiments. 
In this way we can verify that Mendel’s laws do in fact hold to within 
the accuracy of our measurements. And once we know that Mendel’s 

laws arc true, we can predict the effect of many generations of inbreeding 
on our plants. 


1.2 Plan of this book 

We begin our subject by recalling some simple arithmetical facts: 
they occupy this chapter and the nc.xt. Various useful calculating 
devices and methods of computation are considered, both with paper 
and by the use of various mechanical devices, such as desk calculators 
punched card equipment, and so on. Graphical methods of calculation 
such as slide-rules and nomograms are explained rather later, namely 
m Chapter 7. C hapter 3 sets out in a systematic way the fundamental 
rule,, of algebra and will be useful for purposes of revision. Elementary 
algebra as explained in most books deals largely with equations, or 
equalities, but in practice wc often require to know of two quantities 
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which is the larger one and which the smaller; hence it is important 
to discuss “inequalities” or the comparisons of magnitudes. I’his is 
done rather briefly in Chapter 4. In Chapter 5 we see how algebra 
and geometry are bound up with one another, so that many problems 
in algebra can be represented graphically, and many geometric prob¬ 
lems can be solved algebraically. 'I'his leads on naturally to the study 
of logarithms in Chapter 6; these are numbers which are not only 
useful in assisting with arithmetical calculations but also have many 
other applications to the study of rates of growth, chemical reactions, 
and other phenomena, and in addition have a geometric interpretation. 
In Chapter 7 these logarithms are applied to slide-rules, nomograms, 
and similar calculating devices. In Chapter 8 we begin the study of 
rates of change, or “differential calculus”; this can be applied to all 
kinds of problems in which changes arc taking place, e.g. growth, 
chemical reactions, and motion, as well as to finding what are 
the maximum and minimum values of a variable quantity. When 
looked at in reverse this becomes “integral calculus”, and can 
rather unexpectedly be used for such problems as finding lengths, 
areas, volumes, work, energy, centres of gravity and moments of 
inertia. 

In Chapter 13 we go on to the study of series. These sometimes 
occur naturally in nature, as when an organism such as a bacterium 
reproduces itself with “geometric” rapidity, the whole population 
doubling itself in each generation. But for the most part series are 
mathematical devices which help us to solve otherwise intractable 
problems. We continue by introducing yet another useful device, tlic 
“square root of minus one”: this turns out to be very much less mys¬ 
terious than it sounds. Chapter 16 is a recapitulation of some of the 
more important facts of physics and chemistry, together with a dis¬ 
cussion of the appropriate mathematical methods. In Chapter 17 we 
go on to consider the various ways of solving equations. We find that 
any equation can be solved, at least by numerical methods. 'I'his leads 
on to the study of “matrices” which are nothing more than collections 
of numbers. 'I'hey have been used increasingly often in biological 
calculations in the last 20 years, both in statistical work and in the 
theory of artificial selection. 

In Chapter 19 we discuss the “ laws of chance” which are so 
important for geneticists. 'I'hcse laws are applied in Chapter 20 to 
the study of natural variation, including the discussion of averages, 
range or spread, and the correlation between different characters. 'I'his 
is followed in Chapter 21 by a review of a few useful and important 
statistical procedures. (Space does not permit of more than a very 
brief introduction to this immense field.) 

The book ends with an account of an arithmetical notation by means 
of which calculations can be greatly simplified in many branches 
of Mathematics. 'I’his notation is actually 200 years old, having been 
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(apparently) first discovered by John Colson, F.R.S. But it has been 
strangely neglected. 

At all points an endeavour has been made to keep the argument in 
as simple a form as possible. But the problems under discussion are 
of very different degrees of difficulty. And some fairly difficult sections 
have been added in places where the author has felt it appropriate to 
explain some particularly tricky point in greater detail and with greater 
rigour. These sections may be omitted at a first reading with con¬ 
siderable advantage. Other sections deal with special devices used 
in solving particular problems; they are again not needed for a general 
understanding of the subject, but may be useful for reference when 
required. Thus the reader may well wish to pass over the latter parts 
of Chapters 3 and 5 (say from Sections 3.8 and 5.10 onward) as these 
deal with more specialized topics, and are written rather concisely, 
whereas in Chapters 6, 7, and 8 there are discussions of logarithms, 
nomograms and the calculus, subjects of much more general applica¬ 
bility. However, these harder sections have not been specially marked 
in any way: it has been felt better to lea\*e the reader quite free to make 
his own choice according to his individual preference. 


1.3 Accuracy 

Pure mathematics is often looked upon as being a perfectly exact 
science, the physical sciences as being next, and the biological sciences 
still less so. In a limited sense this is true: when we are dealing with 
numbers in their own right we have no experimental error to contend 
with, and our formulas will often be exactly true. On the other hand 
many biological measurements will be limited to 2- or 3-figure accuracy. 
Even when greater accuracy would be possible, there would often not 
be much point in it. For example, there would be no useful purpose 
served in measuring the weights of the inhabitants of a town to the 
nearest milligram, for all these weights would be substantially altered 
within a few hours. Thus thi. accuracy of our results is limited by the 
inaccuracy of our mcasuremer !j. 

But perhaps it is worth pointing out that accuracy of reasoning is 
just as desirable in biology as in mathematics, physics, or any other 
science. \Ve may take two examples to illustrate this, one being 
biological, and one mathematical. It is known that in certain species 
of cave fish the eyes are degenerate. \Vc might explain this as follows; 
since the fish live in practically complete darkness, they have no use 
for their eyes, and so do not exercise them; and since they are now 
atrophied, it seems plausible that in the course of thousands of vears 
the eyes have degenerated through lack of use. Following this line of 
argument, we might even be able to construct a quantitative theory 
which would explain the observed facts to a high degree of accuracy. 
Nevertheless no such eflect has been obseived in animals whose heredity 
has been studied experimentally, and there seems even' reason to 
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believe that the explanation is wrong. A more plausible theory would 
be that mutations tend to occur which damage the eyes; since the eyes 
are no longer used, natural selection does not eliminate the fish with 
defective eyes, and they pass on their defects to future generations. 
Which theory is accepted may not matter very much when we are only 
considering the particular case of these cave fish, but clearly it will 
be very important when we come to examine the problem of evolution 
in general. In the same way, an apparently slight error in a mathe¬ 
matical argument may lead to completely wrong conclusions. A famous 
example is the “proof” that i = 2. Let us suppose that .v — i, and 
y= I, so that x = y. Multiplying both sides of this equation bv .v, 
we obtain .v^ = xy, and on subtraction of y-, x- — y- = xy — y-. 'I'his 
is the same as saying that (.v + _)’)(.v — y) = y (.v — y), and on dividing 
by the common factor (.v —y) we obtain .v -by — y, or since .v -- 1. 
y = I, this proves that 2=1. What has gone wrong } If we put in 
the values x — i, y — i, we see that all our equations are correct as 
far as the point where we asserted that (.v -b y)(.v —y) — y (v — y); 
we have broken the rules by dividiitg through by the common factor 
X —y, which is zero. Division by zero never gives a definite answer. 
When we say that 10/2 = 5, we mean that 5 is the number which 
when multiplied by 2 gives the product 10. So o o ought to mean the 
number which gives the product o on multiplication by o. but that is 
true for any number whatever. In other words, o divides into o exactly 
cither once, twice, or as many times as we like. On the other hand, 
i/o would be the number which when multiplied by o gives the pro¬ 
duct I. 'I’his is impossible: no multiple of zero is ever equal to i. 
'rhtis division by zero is an unfruitful operation, and it is customary 
among mathematicians to ban it entirely. 'I’he punishment for a breach 
of this law' may be an absurd answer to a calculation. 'The correct 
procedure in the above case would be to say, “We kin)w that (.c | v) 

(jc — y) = }’ —>')• "1 here are now two possibilities, either ,v v o, 
or else a: — y is not equal to o [in symbols, .v —y o], and we are 
allowed to divide by (.v — y), obtaining x y - - y. In fact we know that 
in our case it is the first possibility, x—y o, which is the correct 
one.” Since Ihomathematics rests, as it were, on the twin foundation 
stones of Biology and Mathematics, it is necessary to make sure that 
its foundations are secure on both sides, it is of no use building an 
elaborate mathematical calculus on an inadecjuate theory; and equallv 
well the most perfect biological insight will be largely nullified by faulty 
calculation. Unfortunately in bi(>logical matters only experience can 
be a safe guide: but mathematically the pitfalls are for the most part 
relatively few in number, and well-defined, such as the inadvisabilitv 
of division by zero, and they will be pointed out when they occur. 


1.4 Experimental error 

'[’here are several kinds of inaccuracy affecting calculations. Besides 
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the possibility of inexactness of reasoning—ahvays to be avoided if 
possible—we have already mentioned the inaccuracy of measurements. 
Now there are some measurements which we can make practically as 
accurately as we wish, with sufficient trouble. Thus if we are measuring 
the height or weight of an animal, ordinary scales and balances are 
frequently good enough to give us the answer to an accuracy well 
beyond anj'thing we require (though of course for special investiga¬ 
tions this may not be true). In such a case it is usually good enough 
to specify the measurement only to a certain limited accuracy, and also 
to state the maximum error. Thus if a man’s height is stated to be 
i8i centimetres, or i-8i metres, measured to the nearest centimetre, 
we mean that the error of measurement does not exceed ^ centimetre: 
his height lies somewhere between 180*5 181-5 centimetres. We 

then say that the measurement is correct to 3 significant figures. (“Signi¬ 
ficant figures” are those whose value is definitely kno\^'n, but excluding 
zeros at the left-hand end of the number. Thus 181, i-8i and *000181 


all contain 3 significant figures.) In practice, this strict definition 
occasionally gives trouble. If the original measurement turned out to 
be 180I centimetres, measured as carefully as possible, then it could 
be the case that the true measurement is really slightly smaller than 
this, giving 180 cm. to 3 significant figures, or slightly greater, giving 
iSi cm. 'rhus it is better to consider a measurement as correct to 3 
significant figures, if the error is not greater than about 5 units in the 
next place. Now, generally speaking, if each of our measurements is 
correct to (say) 3 significant figures, then the result of any calculation 
we do on them will also be correct to about 3 figures. However, this 
is not i]uitc true, as there will be a slight increase in the error at each 


step in the calculations. 'Thus, if we multiply 181 by 2'S3, we obtain 

Ihit it these numbers arc correct to 3 figures only, “181” 
may stand for any mimb''r between about 180*5 ^^^d about 181-5, and 
“'2-53” may stand for any number between 2-525 and 2-535. The 
jmoduct lie;, between 180-5 •; 2-525 455*7625 and 181-5 2-535 = 

4bo 1025; rongldy speaking, between 456 and 460. So the value 
4 .> 7’63 heurer necessarily be correct to 3 figures: the error may 

be as lurgi: as 2 ifi liie third iigurc. .Xgain if we add 181 H- 253 4- 
102 'vhere all ihc numbers liavc three-figure accuracy, the real 

1 t.d iiiav Iw -.IS small as iSo-5 4- 252-5 - 101-5 -4 375-5 ■ 910, or as 

im at as i.Sf 5 : 253-5 102-5 o' 376*:" - 914. Krrors always tend to 

accuinuMtc in diis manner; but although this tendency cannot be 
etunth. a\*e'iUi.i, t can be kej t down to a minimum by using one or 
iv,(- more h'2urc.s in the calculations than are needed in'the final result 
fin eAr.ptional cases several rm.re figures may be needed). Fortunately 
;ds., the errors introduced in tlu- diflcrent steps of the calculation will 
IrequeTUly carscel one another co a large e*slent, so that the error in the 
linal answer will rarelv he \ e» v lui 't. 


The n,;n.hcT of "siKnifioint figures” m;iy be considered as a rough 
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way of stating the proportional error. If a measurement comes to 
100*1, correct to 4 figures, we know that its error is not more than 
about -05, or i part in 2000: but if it is 999 9, then the same error, 
•05, amounts to only i part in 20,000. Thus a number is correct to 
4 figures when the error lies between i part in 2000 and i in 20,000, 
five-figure accuracy corresponds to an error between i in 20,000 and 
I in 200,000, and so on. 

A point worth noting is an inconsistency which sometimes occurs 
in the usual way of writing very large numbers, as, for example, the 
population of a large country. According to the 1931 census, the 
population of the United Kingdom was estimated as 46,038,357, or 
46 millions, to the nearest million. Now when this figure is considered 
to the nearest million, it is usually written 46,000,000; the zeros here 
do not actually stand for zero itself, but rather for digits whose exact 
value does not matter much, or is unknown. Such digits are of course 
not significant figures. If now we consider the population of England 
and Wales, 39,952,377, to the nearest million, this must be written 
as 40,000,000. Ilere, however, the first zero is significant—the popu¬ 
lation is 40 million, and not 41 or 39 million: the other zeros are just 
put in to show that we arc counting in millions. Hut there is nothing 
in the way the number is written to show that distinction, d'here are 
two ways in which we can overcome this difficulty. Firstly, we can 
write 40,000,000 as 40 X lo'"', thus showing clearly that we are counting 
in millions. Alternatively, we can invent a symbol, such as ?, or *, 
or 3 to stand for a digit whose value is unknown or unimportant; the 
number 46 million could be written 46,333,003 and the number 40 
million as 40,333,333, thus making a clear distinction between significant 
and non-significant figures. Counted to the nearest hundred-thousand, 
the population of England and Wales would be 40,033,333. 

So far we have been dealing with cases in which the error is so 
small, in comparison with the quantity to be measured, that it is enough 
simply to state the number of significant figures correct in the calcula¬ 
tions. 'I'he opposite case is that in which the error is of the same order 
of magnitude as the effect to be measured. For example, if we are 
trying to compare the yields of two varieties of wheat, we can easily 
find that the differences of fertility, even between different parts of 
the same field, are greater than the difference we are tryung to measure. 
When this is so, special statistical methods have to be applied in 
order both to reduce the error as much as possible and also to make 
a careful estimate of its magnitude. We shall discuss these points 
later. 

Finally we may notice one other frequent form of inaccuracy. Often 
it is not possible to take into account all relevant factors, either because 
we do not know all of them exactly, or because they wcjuld make the 
calculations too complicated. Instead we use a deliberately simplified 
model of the situation, knowing that we are introducing some error, 
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but hoping that our model will reproduce the essential features of the 
situation. There is of course little harm in this sort of idealization^ so 
long as one is aware that it is a deliberate simplification and has its 
limitations. 
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2.1 Simple counting 

Arithmetic began with the discovery that it was possible to count 
groups of objects. Nowadays, when we have machines which will do 
thousands of multiplications and additions per second, and carry out 
a whole complicated programme of computation, we usually take such 
matters as simple counting entirely for granted. All the same, if the 
universe was constructed in such a way that when we counted a set of 
objects we sometimes got the answer 2, sometimes 4, and sometimes 5, 
then not only mathematics but also a great part of science as we know 
it simply would not exist. No doubt the discovery of counting was 
made by degrees, and not suddenly: but we could imagine it happening 
in some such way as this: 

Jacob: Father, you know how difficult it is to make sure you’ve got 
all your sheep in the pen. 

Isaac: Yes, son. My memory’s pretty bad these days—and there 
are so many names to remember: Deborah, and Mary, and Rachel, 
and Beth, and ... 

Jacob: Well, I’ve found out an easier way. 

Isaac: Tell me, my son. 

Jacob: When the sheep walk into the fold I call them in turn “one, 
two, three, four, five. ...” 

Isaac: Where did you get these outrageous words, Jacob? 

Jacob: 'Vhey came to me in a dream, father. When the last sheep 
has gone in, J should have got to “fifty-seven”. If I haven’t, I know 
there’s something wrong. 

Isaac: Well, you can say what you like, son, but I like to stick to 
old, well-tried methods, with none of this new-fangled gibberish like 
‘‘one, two, three”. And what will you do when the sheep have lambs? 

Jacob: I haven’t thought that one out yet, father. But my method 
has always worked perfectly so far. 

After the discovery of counting itself, the processes of addition, 
multiplication, subtraction, and division must each have been in their 
time first-class discoveries — though now they arc only too painfully 
familiar to every schoolboy. 

By counting groups of objects in this way we obtain the numbers 

•) 
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zero, o (when there is nothing there) and the “positive integers** i, 2, 
3, 4. . . . These have many fascinating properties, but the most 
important one from the practical standpoint is that of “unique factor¬ 
ization’*. 


2.2 Factorization of numbers 

A number greater than i which is exactly divisible only by itself 
and I is called a “prime number”: for example, 2, 3, 5, 7, ii and 13 
are primes. Other numbers (greater than i) are “composite” and can 
therefore by definition be split up into factors, which are either prime, 
or else can themselves be split into smaller factors, and so on, until 
finally we must end up by resolving the number into factors which are 
all prime. Thus 60 = 2x30 = 2x5x6 = 2x5x2x3 or alter¬ 
natively 6o = 3X20 = 3X2X io = 3X2X5X2or6o=iox6 
= 5X2X3X2. Now it is a remarkable fact that in whatever way 
we split up any number, we always arrive at the same prime factors 
in the end. Thus 60 decomposes into two 2’s, 3, and 5, and although 
we may obtain these in different orders according to the different ways 
of doing the decomposition, we could never obtain factors like 3X3X7. 
This is not an obvious fact (at least not to the writer) and a proof is 
given later. (Just to show that there are more difficult problems we 
mention Goldbach’s conjecture, that every even number except 2 can 
be expressed as the sum of two prime numbers, for example, 4=2-{-2, 

b =3 + 3 > S — 3 + S> = 3 "b 7 * = 5 + 7 » ^^^id so on. Although 

this conjecture sounds extremely simple, and no even number is kno^^'n 
for which it is not true, nevertheless to supply a formal proof has 
so far baffled the ingenuity of the best mathematicians in the world.) 
As it is often very useful to be able to split a number up into its 
factors, we give below' the standard tests for the most frequent prime 
factors. 


A number is divisible by 2 if the last figure is even (o, 2, 4, 6, 8). 
It is divisible by 2^ = 4 if the number formed by the last two figures 
is divisible by 4 (e.g. 312 is divisible by 4 because 12 is), and by 2=^ = 8 
if the last three figures are divisible by 8, that is, if the last two figures 
are divisible by S and the preceding figure is even (e.g. 1216) or if the 

last two figures are divisible by 4, but not by 8, and the preceding 
figure IS odd (e.g. 1312). 

A number is divisible by 3 if the sum of all its figures is divisible 
■*y 3 J3^2, because i -f 3 6 -f 2 --- 12 is divisible by 3): and 

by 2“ y if the sum of all its figures is divisible by 9* 

A number is divisible by 5 if it ends in o or 5, by 52 = 2«: if it ends 

in 00 25, 50 or 75, and by 5*'* 125 if it ends in 000, 126;, 2SO 37c 

500, 625, 750, or 875. ‘ ^ «.>/:>» 

A number is divisible by i r if the total obtained by alternately 
adding and subtracting consecutive figures is divisible bv ii. For 
example, 8481 is divisible by 11, since 8 — 4 -f S — i ^'i i 
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Divisibility by 7 is best tested by straightfonvard division. Although 
there are dodges which enable the work to be slightly shortened, they 
scarcely seem worth the trouble of remembering. It is also rarely worth 
while testing for prime factors above ii. 


2.3 Fractions, negative and irrational numbers 

These tests for divisibility are useful in dealing with fractions. It 
must have been discovered quite quickly that numbers could be applied 
not only to counting, but also to measuring, as when we talk of a weight 
of 2 kilograms or a length of 3 metres. It must also have been soon 
discovered that ordinary integers or whole numbers are not sufficient 
for expressing all measurements, and a number of new kinds of num¬ 
ber” have had to be invented for the purpose. The simplest of these 
new types is the fraction: as ^ kilogram, 17 100 (or -17) metre. We 
assume that the reader is well acquainted with the properties of 
fractions; briefly summarized they are: (<a) no fraction has zero denomi¬ 
nator; (i) a fraction is unaltered in value by multiplying or di\iding 
both numerator and denominator by the same number (otlier than o), 

as ? = — • (c) to multiply fractions, multiply their numerators to 

5 15 

obtain the new numerator, and their denominators to obtain the new 


denominator, as ^ X {ti) to divide by a fraction, exchange 

3 7 21 

numerator and denominator of the di\isor and multiply, as 

X ^ • (e) fractions with the same denominator ntay be added 

3 7 2 »’ 26 

or subtracted by adding or subtracting their numerators, as ^ -f- ^ 

4 _ 2 + 6 4 _ 4, /yx Q Q (provided y is not o). 

5 "S 5 ’ 

All other properties of fractions follow from these six rules. 'I'he most 
difficult operation is that of reducing fractions t() a common denomina¬ 
tor for the purpose of addition or subtraction. 1 his can be most simply 
done by factorizing the denominators, as far as is convenient. I hus 



d- 


I 

12 


factorize 4b = 2.23 ( 2 X 23), 69 — 3 - 23 » 


to add 4- / 

46 69 

12 = 22.3. [A low dot, as in 2.23. denotes multiplication in European 
texts, and is not to be confused with the decimal point, as in 2-23, 
which is Britain is written high. In America the convention is reversed, 
the decimal point being low and the multiplication point high. In 
many European countries a comma is used for the decimal pomt, as 
2,23,] Now since 12 contains as a factor, so must any multiple of 
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12; and, in particular, so must the lowest common multiple (L.C.M.) 
of 46, 69, and 12. The L.C.M. must also contain 3 and 23, for a similar 

7 7 

= —— as a 

46 2.23 

fraction with this denominator, we must multiply both numerator and 
denominator by 2.3 = 6, i.e. by the factors of the L.C.M. not already 


reason, and therefore is 2^.3.23 = 276. To express 


present in the denominator 2.23, and we obtain In the same 

46 276 


I I 

way ^ = - 

69 3.23 


1.2 


_ 4 


3 - 23 - 2 “ 


, and — 
276 12 


I _ 23 


276* 


Adding together all 


these fractions we obtain or on factorizing numer- 

276 276 ® 


ator and denominator, ^ = L = I, after cancelling out the 

2-.3.23 2“ 4 

factors 3 and 23. 

In this way we find it necessary to extend our system of numbers 
to include fractions as well as whole numbers. Another important 
extension is required to include opposites. We know that “hot** is 
opposite to “cold”, “up” to “down**, “forward** to “backward**, 
“positive electricity” is opposite to “negative**, and so on. We there¬ 
fore invent a new kind of number or symbol to represent the results 
of measurements where two opposites are involved; we call it a “nega¬ 
tive number”. If a measurement upward, say, of 4 feet, is represented 
by the number 4, then a measurement downwards of 4 feet is usually 
denoted by the symbol —4, the minus sign showing that it is reversed 
in direction. (It is suggested in Chapter 22 that this could also con¬ 
veniently be written as t, by inverting the symbol 4, and similarly z 
would stand for —2. This apparently trivial device is there sho%\'n to 
be extremely useful.) We can also have negative fractions, as a measure¬ 


ment downwards of 43 feet might be represented as —43 or — 

2’ 

even this \'cry comprehensive system of integers and 
tractions still has its defects, as Pnhagoras observed. We would like 
to be able to express the jength of any line as a number. But suppose 
that IS a triangle with a right angle at B, and that AB and BC 

are both of unit length. What is the length of AC? By Pnhagoras’s 
theorem. AB^ -^ BC^~ = AC^ that is AC^ 2. or AC = ^^3. if there 
IS such a number. But there is no fraction x>y whose square is exactly 
equal to 2, so that we cannot express the length of AC by any whole 

fraction. e can most easily see this bv spliuing both .v 
andy into prime factors; say, .v ^ pqr . . . u and v =- POR . . . U' Then 
if the equation (.v/y)= = 2 was true, this would impl^ tliat = ay^ 
^rrpnrr . . . uu = zPPOQRR . . . iny. But this cqLtion cannotij 
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true, since the left-hand side contains an even number of prime factors, 
the right-hand side an odd number, and we know that a number can 
be factorized in only one way. But although we cannot find any fraction 
to represent the length of AC exactly, we can approximate to it as 
closely as we like. If we draw the figure and actually measure the 
length of AC to 3 figures, we obtain the value 1-41, whose square 
1-412 = 1-9881 is very close to 2. To 4 figures we find AC = 1-414, 
AC^ = 1-999396, and to 5 figures AC = 1-4142, so that AC-= 
1-99996164 = 2-0000 correct to 5 figures. Carrying on in this way we 
can, by taking a sufficient number of figures, obtain a number whose 
square is as near 2 as we wish. 

The usual trick by which mathematicians get round these difficulties 
is to invent an “unending decimal” i 41421356 ... to represent \ 2. 
The figures in this decimal can be obtained by any standard process 
for calculating square roots: since such a process can never terminate, 
we can calculate as many figures of the decimal as we wish. 'Fhe 
interpretation we give to such an “unending decimal” is this: although 
we cannot find a terminating decimal whose square is exactly equal to 2, 
nevertheless by taking enough places in the sequence 1-41421356 . . . 
we can find one whose square is as near 2 as we please. 'I'hus we have 
seen that 1-414^ differs from 2 by less than 001, and 1-4142- differs 
from 2 by less than -0001 : we can continue in this way as far as we 
like. A similar situation occurs with a fraction such as T this is 33 
correct to 2 figures, -333 correct to 3 figures, and we may say that it 
is represented by the infinite (or “recurring”) decimal 333333. . . . 
In short we can say that an unending decimal is a convenient way of 
summarizing a process whereby we can obtain an answer to any degree 
of accuracy. We shall later come across many other such processes, 
especially in the calculus, where it is impossible or inconvenient to 
give the answer to a problem in a finite form, but where it is possible 
to give a method by which we can get as near to the answer as we 
wish. Such a process is called “cornergent”. 

In practice, of course, it is impossible to make a measurement to 
unlimited accuracy: only in exceptional cases can we expect to attain 
anything like y-figurc accuracy. I'or tliis reason it may seem rather 
like splitting hairs to bring in unending decimals. One can imagine 
the reader asking, “Why shouldn’t we simply take all our measure¬ 
ments and calculations to 7 figures only? i hat would be quite ample 
for my purposes, and avoid a great deal of complication.” 'I’he answer 
is that some such line of approach would indeed be pr^ssible, but not 
quite in such a crude form as suggested. In a calculation such as 
V2 — although \/2 1-414214 to 7 figures, and .J = 1-500000, the 

difference \/2 — g = — 085786 is correct only to 5 figures. Two 
figures have already been lost, and the margin of safety may not be as 
adequate as it seems at first sight. We cannot in fact evade the diffi¬ 
culties so easily. 
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We shall not go further into this subject. The reader who is inter¬ 
ested will find many books in which the matter is discussed more fully, 
the best being perhaps G. H. Hardy’s classic. An Introduction to Pure 
Matiiematics (loth edn., 1952, Cambridge). We shall merely state that 
it is convenient to consider unending decimals as numbers on an equal 
status with terminating decimals and integers. This enables us to speak 
of square roots, cube roots, logarithms and many other quantities as 
numbers; if we restricted ourselves to exact fractions that would not 
be possible. In addition we can show that these unending decimals 
can be added, subtracted, multiplied, divided, and otherwise used 
exactly like other numbers: that is indeed almost obvious to common 
sense. For example the equation (.v + i)(a: — 1) = a:* i gives 
(y'2 -j- i)(V 2 — i) = 2 — 1 = 1. To test if this is correct, let us take 
Vs to 5 figures, 1-4142. Then (-v/2 + — i) = 2 * 4 * 4 ^ X * 4^42 = 

•99996164 = I 0000 (correct to 5 figures). If we took \/2 to 7 figures, 
the equation would be correct to 7 figures, and so on. 

Mathematicians usually speak of all decimals, terminating or 
unending, positive or negative, as “real numbers’*. The word “real” 
here is perhaps confusing, as it no longer has any special connection 
with the ordinary meaning of the word “real”. The use of the term in 
this sense seems to have arisen historically by a misunderstanding we 
shall come across later; for the present the reader must simply accept 
it as the usual (and international) phrase. 

It is also very convenient to suppose that the measure of a length 
or a weight is a real number. After all, we can measure a length correct 
to 2, 3, 4, 5 or 6 places. There is no apparent limitation to the number 
of figures we can obtain, apart from the difficulty of making the 
apparatus sufficiently delicate. We can accordingly imagine a length 
as represented by an unending decimal, again in the sense that in theory 
we can determine as many figures of the decimal as we please. Strictly 
speaking this may be wrong, as atomic and quantum theory suggests 
that there is a limit to the accuracy of any measurement. Strictly 
speaking, too, relativity suggests that Euclid’s geometry may be false: 
but it makes little difference in practice, 'rhroughout this book we 
shall talk as if all measurements of quantities such as length, area, 
angle, volume, mass, etc., were ideally expressible as “real numbers”, 
i.e. as unending or terminating decimals, and as if Euclid’s geometry 
was exactly true. Any errors we may commit are quite negligible for 
all or almost ail biological purposes: and it maybe added that many 
(4 our results can be proved by other methods which do not involve 
these assumptions, but which are considerably more difficult to 
follow. 

2.4 Calculating machines 

^■here are many modern devices for cutting down the labour and 
strain of calculation. Not only do these save work, but also they 
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increase accuracy, since they are less liable to error and do not get tired 
like the human mind. Some of these devices, such as logarithms, 
nomograms and slide-rules, will be discussed in later chapters. Here 

we shall consider mechanical devices. 

The simplest form of calculating instrument is the adding and sub¬ 
tracting machine. A very simple, inexpensive and efficient machine 
is the “Exactus”, which consists of a number of bars pushed up or 
down by a stylus: it is operated by hand. Although it is primarily 
intended for addition and subtraction, it can readily be adapted for 

division in the following way. , 

First we multiply the divisor by 2, 3 and 6. Multiplication by 2 
can be done either mentally or manually in the usual way, or else 
mechanically by adding the divisor to itself. To find three times the 
divisor add the divisor to its double. To find the six times multiple, 
double’the three times multiple. Thus supposing we wish to divide 
3017400 by 243. We calculate 

2 X 243 = 486 

3 X 243 = 486 -h 243 = 729 
6 X 243 = 2 X 729 = 1458 

These multiples, together with i x 243 = 243, are set out on a separate 

piece of paper (see Fig. 2.1 overleaf). . •• • i j 

We now put into the adding machine the number to be divided, 
3017400. Looking at the first three figures of this we see that the 
largest of the four multiples which can be subtracted (and not give 
a negative answer) is i X 243 — 243. So i is written do\ni ^he first 
figure of the quotient, and 243 is subtracted, leaving 58. 1 he next 

figure is 7, so that we now have to see how many times 243 will go into 
587 Looking at our four multiples, we see that we can subtract 
2 243 = 486, leaving loi, and giving 2 as the second figure of the 

quotient Bringing down the next figure, we have remainder 1014. 
From this we can subtract 3 X 243 = 729. leaving 285, and then 
I X 243, leaving 42; i.e. the next figure of the quotient is 3 i = 4. 
Similarly we find the last two figures to be i and 6 + 1-7, 

quotient 12417, and the remainder is 69. 

'I'his process can be applied equally well to manual division (without 
a machine): it avoids the trouble of guessing how many times the 
divisor will go at each step, and also the labour of doing a multi¬ 
plication sum each time. 

Alternatively, by writing the multiples of 243 on a separate slip 
of paper, the necessity of rewriting them where they are printed overleaf 
in italics can be avoided: the slip is simply moved along so that the 
appropriate multiple to be subtracted can be seen at once at each stage, 
and the subtraction performed mentally. 
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(IX) 

( 2 X) 

( 3 X) 

(6X) 


1014 

729 

285 

243 

420 

243 

1770 

I45S 

312 

^3 

69 (remainder) 
Fig. 2.1—A method of long division 


243 

486 

729 

1458 


I I 

12316 


3017400 

243 

587 

486 


= 12417 (quotient) 


The Exactus can also he adapted for multiplication in a similar way, 
though rather clumsily. I'hus to multiply 243 by 12417 we shall simply 
do the above process in reverse, working out the i, 2, 3, and 6 times 
multiples of 243 as before, and then setting out 10000, 2000, 100, 300, 
10, I, and 6 times 243 and adding: (100 4 -300 gives 400, i +6 gives 7). 


243 X 10000 = 

2430000 

2000 

486000 

100 

24300 

300 

72900 

10 

2430 

I 

243 

6 

J458 

243 :: 12417 -- 

3017331 


I he other type of machine which is most useful for computation 
is the type which will do all four operations, addition, subtraction, 
multiplication, and di\'ision. 1 here arc a number of ditferent makes 
of such machines, some operated by hand and others electrically, and 
^loice between them is largely a matter of taste, coiweniencc, and cost. 
Fundamentally, however, they all work on very similar principles. 

1 hey have a set of levers <jr keys to put any given number into the 
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machine, (usually) a register indicating this number, to check that it 
has been correctly set up, and also a product register and a revolution 
(or multiplier) register. The fundamental operation is simply that of 
addition: by turning a lever, or pressing a key, the number set up is 
added on to the number already in the product register, which then 
shows the total. By turning the lever twice, or holding the key down 
while the machine does two revolutions, the number is added on twice, 
i.e. multiplied by 2. By a backwards turn, or pressing of a subtraction 
key, the number is subtracted; by repeating this the number is multi¬ 
plied by —2, and so on. The important feature of these machines is, 
however, that the product register is movable. Thus if we put the 
number 123 into the machine, and add it with the product register 
in its normal position, we shall obtain the total 123. 

Product register (initially) . . . 0000000000000000 

Number in machine . . . . . . 123 

Product register (after addition) . . 0000000000000123 

If, however, we move the product register, say, 2 places to the right 
then after addition it will show the sum 12300, i.e. we have multiplied 
123 by 100. 

Product register (initially) . . . 0000000000000000 

Number in machine . . . . • . 123 

Product register (after addition) . 0000000000012300 

By repeating the operation we can multiply by 200, and so forth. 'Phus 
to multiply by 298, all we need to do is to perform 8 additions with 
the product register in its normal position, 9 moving it one place to 
the right, and another 2 additions after moving it yet another place. 
The number 298 will then show in the revolutions register, and 298 
times whatever number has been set up will be added on to the product 
register. 

We can, however, do better. To multiply by 298 in this way requires 
2 -} 9 8 = 19 additions. But 298 is 2 less than 300, and so we can 

multiply by 300 and subtract 2 (or perform 2 backward revolutions) 
in the units place: this involves only 5 revolutions, instead of 19. If 
we denote 2 backward revolutions by an inverted figure z, then we 
can say that to multiply by 298, it is better to consider it as 302. 
Similarly if we are to multiply by 37, it is best to consider it as 4^, i.e. 

4 revolutions in the tens position, and 3 backward ones in the units 
position. 'Phis is known as “short cut” multiplication, and in that way 
any number can be multiplied without doing more than 5 revolutions 
in any one place. It is the standard practice with calculating machines, 
and as we shall see in Chapter 22, the idea (originally due to J. Colson, 
1726) has a wide field of application. 

Division is done by ordinary calculating machines by effectively the 
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same procedure as is usually taught in schools for manual division— 
but most electrical calculating machines have fully automatic division, 
i.e. the machine, once started, performs the whole process without 
further interference. Many machines also have automatic multiplica¬ 
tion, which gives a considerable gain in speed and accuracy. 

Recently there have been constructed some very flexible calculating 
machines using electrical relays or valves (sometimes nicknamed 
“electronic brains”). They are mostly suited for doing a repetitive type 
of calculating involving a fairly simple routine (such as, to take a simple 
example, constructing a table of logarithms), and it is difficult to see 
very many biological applications at present. Also worthy of mention 
are the “differential analysers” which solve difficult differential and 
integral equations by graphical methods; but there are only a few of 
these in existence, and they will not be readily accessible to the average 
biologist. 

2.5 Manual calculation 

Even in a laboratory fitted with modern electrical calculators there 
will be a number of calculations which are most efficiently done with 
pencil and paper: so it is desirable to be able to calculate rapidly and 
efficiently. I'his is not in fact difficult: here are a few hints, which 
may be helpful. 

'The general principle is to say as little as possible. Thus when 
adding 2 d- 3 -f 5 say only the totals, “2, 5, 10”, and not “2 and 3 are 
5 and 5 are 10”, which is much more laborious. When multiplying 
123 by 7 say “1 (2) 6 (i) 8; answer 861”, and not “7 3’s are 21, i and 
carry 2, etc.”. 'I'his requires some practice. It is worth mentioning 
that many rapid calculators find it most convenient to read numbers 
from right to Itft, as that is the direction in which most calculations are 
performed. 'I'hus they think of 28 as “8, 2” and 135 as “5, 3, i”. But 
that is a matter of taste. 

We have already mentioned a method of simplifying division. 
IMuhipIication can be done in a single line bj’’ the method of “cross- 
multiplication”, e.g. 

X456 


56088 


'I'he steps arc as follows (beginning from the right-hand end): 


6 >; 3 

I [ 6 - 2 -t > 



--- 18: 

^3 --28: 



4X1-- 5: 


write 8, carry 1, 
write 8, carry 2, 
write o, carry 3, 
write 6, carry i, 
write 5. 
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2.6 Placing the decimal point 


A number of computers are often a little uncertain about placing 
the decimal point in a multiplication or division sum. 'I'he rule is 
quite simple. First do the multiplication or division in full in the 
ordinary way, ignoring the point. Then, in multiplication, if the point 
is a places from the right-hand end in one of the numbers being multi¬ 
plied, and b places from the right in the other, it will be {a -r b) places 
from the right in the product. In division, if the point is a places from 
the right in the dividend, and b in the divisor, then it will be {a - b) 
in the quotient. I'hus 


123 X 456 = 56088 

so that 

12-3 X 4’56 = 56-088 
[a i, b ~ 2., a ^ b = 

and 

20000/147 — 136 (to three figures) 

so that 

2-0000/14-7 = -136 
[a = 4, b I, a — 6 = 3] 


This rule is the one which will be used when working with a cal¬ 
culating machine. Most calculating machines have indicators which 
can be used to mark the decimal point. 

In this connection it might be useful to revive an old and fioNible 
notation which has fallen into disuse. In a slightly modified form this 
would be to denote Ax lo'" by X^. For example, 231 3 would denote 
231 X 10^, or 231 thousands: 231c would mean 231 millions. 231 ^ 
would be 231 X io“ 2 ,* or 2-31, and 231 would be -231. 'The suffix 
shows how many places to the right (of the right-hand end) we find 
the decimal point. (With a negative suffix, count to the left.) 'The 
rules for placing the point could then be simply written 


X, X 

XJ Y, 


y, = [A>i,o)- 

[XlYl.^yy 


'I'his notation could be extended to writing, say, 2-31 X 10^ as 
2331, or 76*92 / 10^ as 76,92. Hut these suggestions are only put 
forward very tentatively. 'They arc not used elsewhere in this book. 


2.7 Checking arithmetic 

Not only is no human computer infallible, but unfortunately some 
types of calculating machines are liable to occasional error. ICven 
where the machine is perfect, the operator is not; so it is very important 
that all calculations should be checked. 


10 * mciins I / jo* - 


•01 : see Section 6 ,io, 
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When a long calculation is being performed it is usual to insert 
checks at various points in the process of computation. It is not usual 
to check every individual operation, although if the calculation is a 
specially important one, it is desirable to repeat it entirely and pre¬ 
ferably by a different method, or in a different order. (A mere repetition 
of a calculation involves a risk of repeating an error.) 

For short calculations there is a simple method of checking ordinary 
additions, subtractions, multiplications, and divisions, known as 
“casting out the nines”. This is very similar to the method numerol- 
ogists use when finding a person’s “lucky number”. Each letter in 
the person’s name is given a number: A — ly B — C — 3, and so 
on: and all these numbers are added together. Thus John Smith 
would add up to 10 + 15 -h 8 -f- 14 + 19 + ^3 + 9 + “b 8 = 116. 
The digits of the resulting total are again added (1 -f- i -j- 6 = 8), and 
if necessary the process is repeated until finally the name is reduced 
to a single figure (here 8) which is the “lucky number” in question. 
We can apply this process to any number; 183728 would be reduced 
to i-l-8-f-3-!-7-h2-r8 = 29, which again would reduce to 
2 H- 9 = II, and finally to i -f i = 2. This final result we call the 
“reduced value”. Now any addition, subtraction, or multiplication 
sum, if correctly done, will remain true when all numbers are replaced 
by their reduced values. For example, take the addition sum: 

137 reduced value 2 
824 „ „ 5 

73 .. >. I 

/ b ,, ,, 4 


iiio ,, ,, 3 


and we see that 2 5 i -h 4 = 12, with reduced value i -r 2 = 3, 

which agrees with the reduced value of the total. 'Fhe mixed addition 
and subtraction 2432 - 1234 f 256 -• 824 — 56 -- 2222 becomes 
2 — 1 4 5 — 2 = 8, whicli checks: and the multiplication 123 X 

436 == 560S8 becomes 6 x 6 — 9, which checks, since 6 N. 6 = 36 
has reduced value 9. To check a division, we have to rewrite it as 
tlii idciid- divisor quotient -‘. remainder. 'rhus the division of 
3017400 by 243 gives quotient 12417 and remainder 69, or 3017400 = 
243 ■ 12417 -f 69. losing reduced values this becomes 6 9 x 6 6, 

which checks on reduction. 

The reason why this ch. ck works is essentially that the “reduced 
value” of a number is simply the remainder obtained on dividing the 
number by 9 (except that numbers which divide exactly by 9 have 
reduced value 9 and not o). 1 his is because the numbers 9, 99, 999, 

etc., all divide exactly by 9, so that 10 - ^ 9 i, 100 99 ^ i, 1000, 
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etc., must all give remainder i on division by 9. By a similar argument 
2, 20, 200, 2000 ... all give 2 as remainder; 3, 30, 300, 3000 ... all 
give remainder 3, and a number like 213 = 200 + 10+3 gives the 
same remainder on division by 9 as does 2 + 1+3. ^ number A 

has reduced value a, it is simply a plus a multiple of 9, so that we can 
write 

A = a gu 

where u is an integer. Similarly if B has reduced value b, then B = 
b + gv. Therefore the total (A B) = a gu b-\-gv 

= (rt + *) + 9 (m + r) 


differs from (a + ^) by a multiple of 9, so that {A + B) and (a + b) 
have the same reduced value. Considerations like this explain why the 
check of casting out the nines works with addition or subtraction. 
For multiplication we have 

AB = (a + gu){b + gv) 

= ab g ub g av -\~ uv 
= ab g {ub + av + 9 uv) 

so that AB and ab differ by a multiple of 9, and must have the same 
reduced value. 

In casting out the nines we can ignore multiples of 9. Thus a 
number like 3994 can be reduced to 3 -I 4 — 7. ('Fhe direct calculation 
would be 3 4 9 - g \ 4 — 25, 2+5 7.) For 2361, we can ignore 

the 3 and 6 which add to 9, and take the reduced value to be 2 -I- i = 3. 
Again if we try to check the subtraction 210 — 35 = 175 by casting 
out the nines, we find that 210 -- 35 reduces to 3 — 8 = —5, whereas 
17s reduces to 4. 'I’his is a perfectly good check, because - 5 and 4 
difer by 9, and must be considered as equivalent in terms of reduced 
values. 


“Casting out the nines” is, of course, not an infallible check, though 
a very useful one. It is possible to make an error, or series of errors, 
which may pass the test. In particular it is easy to interchange 2 figures, 
writing for example 2659 instead of 2569, and this rather common 
type of error will not affect the reduced value and so will not he 
detected. A rather better check is that of “casting out the elevens”. 


In this check the reduced value is calculated by alternately adding and 
subtracting the figures, starting from the r/^^/i^-hand end. 'I hus 137 
will reduce to 7 — 3 + i = 5. and 824 to 4-24-8 10, which 

further reduces to o — 1 = — i. 'I'hc addition sum 137 4 824 4 73 
4 - 76 =1110 reduces to 5 — i — 4 — i — i, which is a perfect 
check. In this method of checking it is multiples of eleven which arc 


ignored, rather than multiples of nine: in all other respects it is used 
exactly like the “casting out the nines” check, and can be used for 
addition, multiplication, subtraction, and division. It will detect an 
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interchange of two adjacent figures, provided that there is not by bad 
luck a compensating error elsewhere. It follows that a calculation 
carefully performed and checked by both the nines and elevens methods 
is most unlikely to be wrong. 

2.8 Punched cards 

Large quantities of data are tiresome to record and sort by hand. 
If they are put on punched cards then the sorting and at least part of 
the computation can be done mechanically, with a resulting improve¬ 
ment in speed and accuracy. 

I'here are two well-known types of punched cards: the “Cope- 
Chat” type, and the other type which is represented by the “Hollerith” 
and “Powers-Samas” systems. 



Fig. 2.2 A Copc*Chat card, punched for age 54 and male sex 


’The Cnpc-Chat system is . mplicity itself: the only mechanical 
npiipment rcquirctl consists of ; pack of cards, a punch, and a knitting 
TK-L'dte. I'.ach canl has initially a tow t'f holes puttched round the edge. 
1 iie intormation is rec<»rdcd ojt the cards by using the punch to cut 
V-sluipeil slots tr**in certain ht.»les to the edge of tile card (Fig. 2.2). 
W hen a cards has a needle nm through any particular hole, 

and is tliea gei\tly shaken, .ill tliose cards for which tiiat hole is slotted 
will be tree to tall otf the needle, while those cards which have the 
hole lelt uiislotied will he retained. ’1 hus sorting can be done easily 
aiul rapidlv. 


All kiiuls ot ii'.foi'niation 
oidy necessary t<j arrange a 
ticular piece of information 


can be put oit a Cope-Chat card: it is 
ntethod of “coding” by which any par¬ 
ts represented h\' the slotting of certain 
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holes. Such items as sex, place of birth, the first two letters of a sur¬ 
name, can easily be recorded and sorted; we could separate out from 
a pack such items as “all cards representing males” or “all cards repre¬ 
senting females bom in London”, and these might be counted or 
sorted further. The easiest way to record numbers is to have holes 
corresponding to the numbers i, 2, 3, 6, 10, 20, 30, 60, 100, 200, 300, 
600, etc. 12 will then be represented by slotting the holes “2” and 
“10”, and 8 by slotting the holes “2” and “6” (since 2 -(-6 = 8). 

Using this system of punching it is an easy matter to find totals. 
Suppose that a number Ai, say, is punched on the first card, a number 
A 2 on the second, A 2 on the third, and we wish to find the total 
T = Ai -j- A2 + A^- . - . We first extract with the needle all cards 
slotted in the hole “i” and count them: let us say that there are 
such cards. Since each contributes i, together they contribute 
iWj = to the total. These cards are now put back into the pack, 
and all cards slotted for “2” are taken out, and counted. If there are 
^2 such cards, they will contribute 2^2 the total. We replace these 
cards, take out all cards slotted for “3”, and continue in that way. 
I'he required total will be 

T= ni -h 2^2 + 3«3 H- + io 7 /,o -I- 2o;/2o T . . . 


If wc have two sets of numbers punched on the cards, .* 1 , and B, 
(say) on the first card, A 2 and B j correspondingly on the second, and 
so on, then we can find (though rather clumsily) the sum of products 
S = AjBi A Ji-i + A Jij -i . • . • 'I'his as we shall see later is a 
calculation of importance biometrically, used in calculations of 
variances, standard deviations, and correlations. 'To do this we first 
separate out all cards slotted for “i” in the representation of the 
number A. By the method already described, we find the total of all 
the B numbers on the separated cards: let this total be 'I\. 'These 
cards are then replaced, and those slotted for “2” in the representation 
of A are separated. 'The total of the B numbers on these cards is 
found to be, say, T *l'hc cards are replaced, and the process con¬ 
tinues. 'The surti of j>roducts 



AJi, 1 A.B 2 \- A.B^ + . 
r, zT.. i yi\ \ f)T« f 


loT.o 



■ 


There is no need for the numbers “/i” to be difierent from the numbers 
**A”: the process can equally well be used to ex aluate 

A^A^ ) A ..A i A ^A 2 * . ■ ■ .-Ij" } /L' I A 2" 


where is a numl^er recorded on the first card, A 2 the corresponding 
number on the sect>nd, A 2 that on the tlurd, and so on. 

The Cope-Chat sy.stem is very convenient for dealing with data 
punched on to a moderate number of cards - up to a few hundred. It 
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has the good points that it is simple, cheap, and flexible; and also it is 
possible to write extra information on the face of the card. Its bad 
points are that the counting of cards by hand is rather slow, and that 
only a limited amount of information can be recorded on a single card 
with about 150 holes. 

The Hollerith and Powers-Samas systems are considerably more 
rapid than the Cope-Chat, and can deal with more extensive material, 
but they require special mechanical equipment. The two systems, 
Hollerith and Powers-Samas, are very similar in general outline, but 
differ in points of detail: as a rule the Hollerith, which is electrically 
operated, is the more flexible of the two. A Hollerith card (Fig. 2.3) 

OOODOOlSOOOOOOQOOOODOOaDBOOOQOOOOOOOOOOOQOOQOOOOOQ 

I 2 ) « i S 1 9 9 10 U I] I) U l» 16 12 11 19 20 2> 22 2) 24 » K 27 23 n n )l }2 3) 34 33 3» 37 38 19 4J 4i C 43 44 <S 46 4 ] 43 43 X 

222222222222222222222222222222272122222222222222 
33333333333333333333333333333333333333333333333333 


44444444444444444444444444444444444444444444444444 

555S555S555555555S555555S555555555S5555S55S5555555 

66666666666666666666666666666666666666666666666666 

77777777777777777777777777777777777777777777777777 

86688888818888888688886866688686888688888866888866 

99999999999999999999999999999999999999999999999999 

^ ^ I CO iVi-i**** M n » j* V n )4 » v h « 4t 4) u a h 4j i%4sv> 

^^L.LkKI I H ^ ~ 35^ ^Hi ftintiiH rACi/iAiiNC cOi. L*ro. v«CToaiA 

Fig. 2 . 3 —Part of a Hollerith card punched for . . . xiooaSxiK . . . 


has up to So columns, each column containing the lo digits o to 9, 
and two extra places A" and Y for occasional use when required. The 
information is recorded by punching holes through the appropriate 
digits: thus the number “123 ” would be recorded by punching through 
I m the first column, “2” in the second, and “3” in the third. 
I unched cards can be sorted and counted at the rate of 400 cards a 

minute, and a number of other operations are possible, including 
addition and multiplication. 


2.9 The uniqueness of factorization 

\\ e now gi\c the promised proof that any positive integer, greater 

prime factors in only one way. 

Ihis theorem depends on a simpler lemma that if A and B are 
two positive integers such that the product AB is divisible exactly by 





BRUSH UP YOUR ARITHMETIC 25 

a prime p, then at least one of the two integers A and B is divisible 
hyp. 

To see why this lemma should be true let us consider the meaning 
of the product AB. This can be considered as the number of objects 
arranged rectangularly in A columns and B rows, as in Fig. 2.4 (where 

^ ^ 7, B = 6). 

A columns 


/X X X X X X X 
I X X X X X X X 

B rows ! X X X X X X X 

' X X X X X X X 

X X X X X X X 

X X X X X X X 

Fig. 2.4—^The product AB 


Now let us number off these AB objects in order as 1, 2, 3 ... up 
to />; I, 2, 3 . . . up to p; 1, 2 . . . (starting again from i at every pth 
number), and so on till we have exhausted the rectangle (see Fig. 2.5, 
where p is taken as 3). Then every time we come to p in this counting 
we shall have numbered off a multiple of p. Since AB is supposed to 
be exactly divisible hyp, the last object (in the lower right-hand corner) 
will be numbered off as p. In fact, more generally, if the Rth row of 
this arrangement ends with the numbering p, then (and only then) is 
AR divisible by p. I.et the first row which ends with p be the P'th. 
Then in the (F -\- i)th row the numbering repeats exactly the number¬ 
ing in the first row; and this repetition continues. It is not until the 
(2F)th row that we get another row ending in p. 'I'he whole pattern 
then begins to repeat once again, and the (3^')^^^ •s the next that 
ends in p. We thus see that the only numbers R for which the Rth 
row ends in p, i.e. AR is divisible by p, are the multiples F, 2I', 3/*’. . . 
of F, where F is the smallest possible value of R. 

Now we know that both AB and Ap are divisible exactly by p, so 
both B and p must be multiples of F. Hut p is supposed to be prime: 
that is to say, by definition, its only factors are i and p. So either 
F = 1 or J'— p. If P' I, then since AP' is divisible by p, we see 
that A must be divisible by I'. If on the other hand F = p, then since 
.6 is a multiple of /', it must be a miiltiple of p. This establishes our 
result that at least one of A and B must be divisible by p. 

This result can be readily generalized. If a product ABCD is exactly 
divisible by a prime number p, then at least one of the factors A, B, 
C, and D is divisible by p. ('I’his will hold for any number of factors.) 
For supposing that ABCD is divisible by p, on writing this as A . {BCD) 


t 2 3 I I 2 3 I I , 

2 3 1 I 2 3 1 I 2 -i^= 3. 

3 h 2 3 h 2 3 I I 

1 2 3 1 I 2 3 I I 

2 3 1 I 2 3! I 2 

3 ! I 2 3 j I 2 3 1 

Fig. 2.5—Test for divisibility 

by p( - 3) 
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we see that either A or BCD (or both) is divisible by p. If p divides A 
we have proved the required result. If not, then p must divide 
B . {CD), and therefore B and/or CD is divisible by If B is divisible 
hy py then we have proved the required result: if not, then CD is divisi¬ 
ble by Pj so that at least one of C and D has /> as a factor. This proves 
that at least one oi A, B, C, D is divisible by^. 

To prove our main theorem we need one further result, that if 
p divides P, and both p and P are prime numbers, then p — P. This 
follows at once from the definition of a prime, that P is exactly divisible 
only by i and itself. 

Now suppose that is a positive number (greater than i) which 
has been split into prime factors in 2 ways: let us say 

X = pqrst ~ PQRSTUV . . . (2.1) 

where p, q, etc., are all prime. We then know that PQRSTUV is 
divisible by />, so that at least one of the prime numbers P, Qy P, S, 
Ty Uy V is divisible by py and so must be equal to p. For example, 
suppose R was equal to p. Then we can rewrite equation (2.1) as 

pqrst = PQpSTUV. 

Dividing both sides of this equation by />, it becomes 

qrst = PQSTUV. 

A similar argument now shows that at least one of the remaining 
prime numbers P, Q, S, T, U, V must be equal to q\ let us say, U=q. 
'The equation now becomes 

qrst = PQSTqV 

and on dividing both sides by q, 

rst PQSTV. 

Wc can Continue in this ty, showing that prime number on the 

left-hand side also occius somewhere on the right. Finally, when we 
have exhausted all the prime numbers on the left-hand side, which 
then becomes 1, the right-hand side, being equal to the left, also must 
become i, so that we must also have exhausted all the primes on the 
right. 1 hat means that the prime numbers on the right-hand side are 
simply those on the left arranged in a different order; so that apart 

from the order the factorization into prime numbers can be done in 
one wav only. 


PROBLEMS 

0 ) ^^heck the following sums by casting out nines and elevens^ 
Which are correct? 
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{a) 7123 -h 604 — 6215 = 1152 

(i) 30*74 + 1*38 — 20 53 + 40*10 — *67 = 51*02 

(t) 123 X 56 = 6888 

(d) 72 3 X 13*4 = 966*82 

(e) 162^ = 26244 

(/) 25*3 X II-I + 7-2 X 6*3 - 51 X 42-4 - 109*95. 

(2) Show that any exact square must have a reduced value o, i, 4, 
or 7 on casting out the nines, and o, i, 3, 4, 5, or 9 on casting out the 
elevens. 

(3) Show that any exact square must end in one of the figures o, 
I» 4 > 5 f 6, 9. Which of the following numbers are exact squares: 62,500; 
*37.926; 26,451; 41,616; 235,724; 40,903; 1,210; 5,287? 

(4) Show that any exact cube must have reduced value o, i, or 8 
on casting out nines. Which of the following numbers are exact cubes; 
12,531; 110,592; 176,024? 

(5) 1 ( A, B, . . . G are all positive integers, then any positive integer 
which divides exactly into all of them is called a “common factor”. 
Show that any factor of a common factor is also a common factor. 

(6) By splitting the numbers A, B, . . . G into prime factors, show 
how to find their greatest common factor [G.C.F.]*. find the greatest 
common factor of 120, 2100, and 144. Every common factor is a factor 
of the G.C.F. Why? Find all the common factors of 120, 2100, and 
144. 

(7) Similarly show that any multiple of a common multiple of 
A, B, ... G is also a common multiple, and that every common multiple 
is a multiple of the least common multiple [L.C.M.]. 

(8) Let a, b be two positive integers. Let c be the difference between 
them, that is, £■ = a — 6 if a is greater than b, and r = Z» — a if A is 
greater than a. Show that any common factor of a and b is also a com¬ 
mon factor of b and c, and conversely; and in particular that the G.C.F. 
of a and b is equal to the G.C.F. of b and c. What happens i( a ^ b> 
If c o, let d be the difference between b and c. '1 hen G.C.I*. (r and 
d) = G.C.F. {b and c) = G.C.F. {a and b). What happens if c = f/? 
Show that by continuing in this way we can find fairly quickly the 
G.C.F. of any two numbers, even where we cannot conveniently split 
them into prime factors. Find the (i.C.I*. of 45 and 84 in this way. 

By imagining a and b split into prime factors, show that 

(G.C.F. of a and b) x (L.C.M. of a and b) " ab. 

Hence find the L.C.M. of 45 and 84. 

• Also culled the “ hi«hcst common factor ’’ (H.C.I-.l, 
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(9) By imagining the positive integers r, s, and t split into prime 
factors, show that the L.C.M. of r, s, and t is the least common 
multiple of L and t, where L is the least common multiple of r and y. 
By referring to the previous question, show that this enables us to find 
the L.C.M. of any three numbers, even where we cannot in practice 
split them into prime factors. Extend this to a method for finding 
the L.C.M. of four or more numbers. 



CHAPTER THREE 


SOME POINTS IN ALGEBRA 


3.1 Constants, variables and equations 

We suppose that the reader has a working knowledge of most points 
in elementary algebra, as for example how to solve an equation like 
3^; q- 8 = II [3.1: = II — 8, AT = i] or to multiply expressions such as 

2xy + 

(jc — yy= X- — 2xy 
{x + >')(.V —y) = — y- 


+ y)^= x^ + 


The usual definition of algebra in elementary textbooks is that 
“ algebra is the science of operating with numbers, where the numbers 
are represented by letters”. However, we can expand this definition 
to make certain useful distinctions between the different kinds of 
numbers represented by the symbols. 

Some symbols represent fixed unvarying numbers, or “constants”. 
Examples of such constants are tt, the ratio of the circumference of a 
circle to its diameter (derived from the Greek word neplfierpo^, peri¬ 
meter or circumference), e, the “base” of natural logarithms (Section 
6.13),^, the number of joules to one gram-calorie, and K, the dissociation 
constant of any given chemical reaction. But more often a symbol will 
represent a varj'ing quantity, or “variable”. Such a quantity may quite 
literally be changing from one moment to another, or from one day 
to another. Examples of such quantities are the population of the 
world, the weight of a particular man or animal, the amount of chloro¬ 
phyll produced by a particular plant. Quantities such as^, the attraction 
of gravity, or h, the height above sea-levcl of the earth’s surface, vary 
from place to place rather than from time to time; and the temperature 
and pressure of the air vary, both with the place and time of measure¬ 
ment—and the actual observed result may also vary slightly with the 
instrument used to make the measurement. This distinction is, of 
course, a relative one, though none the less useful on that account; it 
we are concerned w'ith experiments made at various points on the 
earth’s surface, then g, the acceleration due to gravity, w'ill vary slightly 
from place to place. But if the experiments arc all performed in one 
place then g will be constant. 

The equations which express the relations between these quantities 
will likewise be of several kinds. An equation such as (v + y)- 
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X® + 2xy is a consequence of the laws of arithmetic, and is true 
for all numbers x and y whether they are fixed numbers, e.g. [i + 

= I 2y/2 -f (V2)® = 3 + 2\/2, or whether they are variables. An 
equation of this sort is often called an “identity”. Other equations 
may represent “laws of nature”; e.g. that if m is the mass of a volume V 
of gas, measured in molecular weight units, P its pressure, and T its 
temperature, then PV — mRTy R being the “gas constant”. This 
holds approximately for most gases, and will be true whether P, V, 
and T are varying quantities or constants. Finally we may have relations 
which are found to hold good in particular circumstances only, as for 
example a single observation in an experiment, “T — I5‘7® C”. In 
such a particular observation T is, of course, a constant, not a variable: 
this remark may sound trivial, but, as explained in Section 8.13, 
neglect of it is a common cause of error in differential calculus, where 
we are dealing with vary’ing quantities. 

3.Z Polynomials 

'I'he simplest kinds of algebraical expression are those containing 
no fractional expressions with letters in the denominator, but only 
powers and products of powers, as for example, uv u v 2y 
3 + 2-v + .v“, P^ + 3PQ" + 3^'!? + 

Such expressions are called polynomials (from the Greek, “many 
terms”) and the parts of the expression which are separated by -j- 
and — signs are called terrns. \n uv u v 2 the terms are tiVy 
w, t’, 2. These polynomials are particularly important because of their 
great mathematical simplicity. As we have said, we shall suppose that 
the reader is familiar with the ordinar>' laws of algebra: but for con¬ 
venience we may summarize them as follows. 


Lazes of addition and multiplication 

(i) Since algebraical expressions represent numbers, any two can 
be added or multiplied. 


(2) Additions can be done in any order, e.g. 2 X-y + (x 3V) = 

(33' -i -'■) d- 2-v-;^' -- .V + 33' + 2x~y. 

(3) Multiplications can be done in any order, e.g. -xab (a + b) = 
3 {a -I- b) ba = {^b) a (a 4- b). 


( 4 ^) [- C 4 ... 4- O) ^ - AB 4 - AC 4- . . , + AD where 

- 4 , ]j, C- ... D may stand for any numbers or algebraical expressions, 
e.g. (.V -ry) {it- 4- uz- 4- ^r) --- (.v 4- y)u^ -r {x y)uv + (x 4- v)z. 

(4b) (/^ 4 - C 4- . . . 


4- D)A = BA CA 4- 


4- DA : for 


example, 3.-! -f 5.4 4-10.4 - (34- 54- io ).4 = i 8 . 4 . This law is 
equnalent to (4a) by virtue ot law 3, that it does not matter in which 
order we perform a multiplication. 
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(5) Index laws (i.e., laws involving powers A^): 

(5a) A”'A’' = 

(5b) {ABY = A" 

(5c) (A'"y‘ = A"'" 

These are really consequences of law 3, by virtue of the definition 
of a power A'^ as a repeated product AAA . . . A of A's. 

(6) For any number or expression A 

oA = o 

A -h o = lA ^ A' A 

(7) If ABC . . . — o, then at least one of the factors A, B, C . . . is 
zero. 


All other operations of algebraic addition and multiplication can 
be analysed into a repeated application of these seven simple laws, 
although they will probably have become so familiar that the process 
is practically instinctive without such an analysis. Thus the operation 
A^A~2A can be analysed as A-\-A -iA-\-iA [law 6] = ( i-i-1).-i 
[law 4b] = 2A ; and the operation A.A = A^ as A.A = A^.A^ [law 6] = 
[law 5a] = A^. 

Similarly (x + >’)- (x + }') (x + >’) 

{x + y)x d- (.V + y)y 
= (x.x + j..v) -f {x.y + y.y) 

- (.v2 + xy) -f- {xy -1- y^) 

— X’ -f- (a^* -{- xy) + 

= .v''^ d- 2 a;>’ d- y'^ 


law 4a] 
law 4b] 
law 3 
law 2 


Subtraction does not need any further laws; all we need say is that 
X —y can be written as x -f (—1);)', from which all the properties of 
subtraction follow. For example, 

X — X =I.V d- [1^"' (“O]-'' 

- o.x - o [law 6]. 

This precise statement of the laws of algebra is not merely of purely 
theoretical or academic interest; later on we shall find it useful to bring 
in symbols (such as matrices) which obey fairly similar but slightly 
different laws, and it is helpful to be clear as to what'exactly can or 
cannot be done with algebraic symbols. 

'rhere are a few equations which are of such common occurrence 
that it is wise to commit them to memory. 'I'hese are 


(x d- y)^ .V^ d- 2.Y>’ 

• (3-0 

(x — y)^ = — 2 xy d y'^ ■ 

• (3-2) 

(x 4 y) (x — y) - x~ — y'^ . 

• (3-3) 

(x d- y)'-^ --- x^ d- 3xty d 3.vy^ d . 

- (3-4) 

4, y-J _ _|, y’^ ^^2 _ J^y 4. yjj 

• (3-5) 

x3 yj ^ xy y y-) 

y^ (x — y) (x* 4 xy d xy^ d Ay'« t y‘) 

• (3-b) 
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and in general 

_^n ^ (*n-l + *n-2y + + . . . +>”-*) (S-?) 

All these formulas can be verified by direct multiplication, and they 
are identities, that is, they are true for all values of x and y without 
restriction. 

3.3 One-variable polynomials 

Polynomials containing only one variable x are of special importance 
and simplicity. 

Consider first the following table, derived from a survey made by 
Karl Pearson and A. Lee, Biometriha, 2 (1902), p. 357, on the inheritance 
of height and other physical characters, x here represents the height 
of a father (rounded off to the nearest inch), and y the average height 
of sons of fathers of height x. 


Table 3.1—Inheritance of height 


Father^s height 
{inches) 

X 

Average 
son's height 

y 

y — 

1 

59 

647 

35-2 

60 

65-6 

35-6 

61 

66*3 

35-8 

62 ' 

656 

34-6 

^3 

66-7 1 

35*2 

64 

1 66-7 ; 

34-7 

65 

672 

34-7 

66 

676 i 

34-6 

67 

680 ! 

! 345 

68 

691 I 

35*1 

69 

694 

34*9 

70 

697 

34-7 

/I 

1 70-5 

35-0 

72 

i 70-9 

1 

i 34-9 

73 

. 72-0 

: 35-5 

74 

71-5 

i 345 

75 

717 

34-2 


We can plot these points graphically in the usual way, measuring .v, 
the father s height, horizontally, and jy, the average son’s height, 
vertically. 1 he points we obtain are represented by black circles in 

Fig- 3-*- 
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But we notice that if we subtract half the father’s height from the 
son’s, obtaining y — this difference is nearly constant and equal 
to about 34*8 inches (see Table 3.1). Naturally in a limited sample 
taken from the general population we should not expect absolute con¬ 
stancy in this difference. There will inevitably be random fluctuations 
in the values, especially at the ends of the range, where the number 
of specially tall or specially short fathers will be small. We shall have 



more to say about this later (Section 21.10). On tlic whole, however, 
the relation y — = 34 8 is reasonably accurate for all the values of 

X and^', and we can write this as 

y hx + 348 .... (3.8) 


We can also plot on the graph the “theoretical” values which would 
be obtained if the relation y = I- 34 8 was absolutely accurate. 
Thus when x = 60, y = 64-8. 'I'hese values are represented by the 
white circles, and it will be seen that they lie on a straight line. This is 
true even for fractional values of x; when .v = 70 6, y 70 2, and this 
also gives a point on the line. 'I'his line is a picttire or “graph” of the 
polynomial ^x 34 8, in the sense that e\ery pair of values x and y 
for which y ~ -i- 34 8 gives w hen plotted a point on this line, and 

all points on the line represent values ol x and>’ for whichy - ix^-34•8. 
From this graph ((ireek f>raphe, “writing” or “picture”) W'e can see 
at a glance how’ the expression i.v I- 34 8 behaves for various values 
of x, and \vc can use it to predict the probal^le height y of the son, 
when we know the height a: of the father. 'I’lie upward slope of the 
graph as we proceed fnmi left to right shows that the taller the father 
is, the taller the son will be (on the average). 'The fact that this is only a 
mild upw'ard slope shows that the average son’s height does not increase 
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as rapidly as that of the father; an increase of i inch in the father’s 
height only gives an increase of ^ inch in the son’s. 

Pearson and Lee showed that a more accurate relation betw'een the 
height of father and height of son is given by 

y = 33*73 + -516 X. 

This, however, hardly differs from the formula y = 34-8 + when 
plotted graphically, at least in the relevant ranges of values of x and y. 

The general form of a polynomial of the first degree (one contain¬ 
ing Xy but not x^y x^y x*y etc.) will be .<4 4 * Bx, where A and B are fixed 
numbers; and this can be represented graphically by plotting the 
value of y ^ A Bx for various values of x. Examples of such 
polynomials, either approximate or exact, are as follows: 


(i) X = temperature of n moles of gas at fixed pressure P, 
y = volume of the gas; 

y = [nR/P] X = A -h Bx 

where A — o and B = nRjP. 

(ii) X = temperature on Fahrenheit scale, 
y = temperature on Centigrade scale; 


y = 


160 c 

- - X = A 4- Bx where A = 

9 9 


160 


B=^^ 


(Hi) 


(Iv) 


a: = year between 1891 and 1901 inclusive, 
y = population of England and Wales in year x\ 
y — — 632,850,000 + 350,000 .V (approx.). 

X ~ tension in an elastic thread or spring, 
y = length of the thread; 

y = A Bx (Hooke’s law) 
where A = natural length under no tension 
B = A X modulus of elasticity. 

All these expressions will be found to gi\'e straight-line graphs: for 
that reason they arc known as Iwear expressions (Section 5.11). The 
word linear or linear in .v ’ applied to an algebraic expression means 
that it is a polynomial containing x but no higher power of .v, such as 
x- or 

If we plot various graphs of the form y A 4- Bx we can see the 
effect of changes in the constants .4 and B (Fig. 3.2). When B is positive 
the graph slopes upward to the right; when negative, downwards. If 
B - c, the graph becomes simply >- -•= A. and is horizontal. Different 
giaphs with the same \'alue of 7 ? are parallel, for example y = 

y . ^ j _l_ phus B determines the slope of the graph, 

and indeed it is usual to consider B as the measure of the slope, (For 
the present we shall not attempt to give formal proofs of these asser¬ 
tions, but simply explore the subject pictorially.) The number **A” 
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is even simpler to interpret: it is the value of y when x = o, i.e. the 
distance from the “origin” or point .v = o, jy = o, at which the graph 
cuts the vertical line x = o. This vertical line is known as the “j-axis”, 
and the corresponding horizontal line j = o is the “.v-axis”. Thus the 
effect of changes in A is simply to move the graph as a whole vertically 
upwards or downwards, as is shown by a comparison of the three 
graphs y = ^x, y = 2 + lx, y = — 1 + ix. When A = o the graph 
must pass through the origin x = o, y = o. 



(i) y = ix (iv) y 2 

(ii) y 2 + ix (V) y = I H 2 x 

(iii) y = —X + ix (vi) y = 15 — 2 * 

Besides giving a general picture of the behaviour of the polynomial, 
graphs arc useful in two ways. For each value of .v we can read off 
the corresponding value of y, and for each y the value of x. This 
reading by eye is not usually very accurate, but quite often it may be 
good enough, and more convenient than numerical calculation. But 
it is quite an easy matter with a linear relation to find the value of .v 
for any given y: this simply amounts to solving a linear equation. 'Fhus 
']( y = A Bx, we have by transferring x to the left-hand side and y 
to the right, ^Bx - A —y; on dividing by - B, .v {—AjB) + (i/Zi)>- 
which is again a linear relation. 'I’hus our relation between father’s 
height X and average son’s height y givesy — lx + 34 8, jc -- 2 y — 6 () 6 . 
Here, however, a rather unexpected word of warning is needed. In 
this relation a: is the father’s height, and y the average of the heights 
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of sons of such fathers. When we write the relation as x = zy — 69-6, 
there is a definite temptation to suppose that it will not matter much 
if we take y as the actual son’s height, and x as the average height of 
fathers having such sons. If the relation between height of father and 
height of son was an exact one, so that the son’s height depended only 
on the father’s and on nothing else, we could change the meaning of 
our symbols in this way without much harm. But, as is common 
knowledge, one father can have sons of very different heights, and it 
is only when we average the heights throughout a large population that 
we get a simple and exact relation. In fact, it turns out that if Y is 
the son’s height, and X the average father’s height, the relation is much 
more like X = 333 than x = zy — 696, which is a very different 

graph. We shall return to this point later (Section 21,10). 


3.4 Quadratic polynomials 

Graphs of expressions containing x^ are also important. These will 
be of the general form y = A Bx + C.v®, and are known as “second 
degree” or “quadratic” polynomials. Examples are the area jy of a 
square of side .v, y — at-; the area y of a circle of radius x, y = tt.v-; 
in general, the surface area y of a body of given shape will be propor¬ 
tional to the square of its length .v, y = Cx-: this will have importance 
in the consideration of the vaiy'ing amounts of substances which can 
diffuse through the walls of cells of different sizes. The distance y 
travelled by a falling body in time x, if it begins with an initial down¬ 
ward velocity is y= jix -f- where the acceleration of gravity, 

is 9 81 mctrcs/sec- in metric units. The heat or power produced by 
an electric current is proportional to the square of the current. Another 
example is the relation between the length a* of a foetus (in metres) and 
its age y (in months) which is y = 2 30 + 9*0 a + iz S x- over a 
certain period, according to Scammon and Calkins {Growth of the 
hinnau body in the pre-natal period, 1929, Univ. Press, Minnesota). 

3*3 shows the graphs ot a number ot such polvnoinials. The 
equation (i), y ■- -■ 29-43^ - 4-905V-, represents the height y (in metres) 
('t a body projected upwards witli velocity 29*43 metres per second, 
A being the lime in seconds after the body leaves the ground. (This is 
a particular case of the equation _v tix -f referred to above. 

However, here y refers to the distance measured upwards, not down¬ 
wards, so u is the initial upward velocity, 29-43 ni/sec, and g — 
—4-905, the minus sign showing a downward acceleration.) Eq^uation 
(‘i), 3' - 10 -f- lo.v -- 4-905A-, gives the height of a body thrown up 

^y^lh velocity 10 metres;sec and starting from a point 10 metres above 
tile ground. It will be seen that all these graphs are of a very similar 
shape, called a parabola. If the constant C in the formula y = A + 
Bx 4 “ Cx~ is positive, they ha\e some\vhere a lowest or viinimum point, 
and the graph curws upwards symmetricallv on either side of this 
point. For example r 18 ~ 8.v ~ 2.v= has a minimum when a = 2. 
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If C is negativ’e the graph has a peak or maximum point, and curves 
downwards on either side. For example, the graph y = 29-43^: — 
4*905a; 2 has a maximum at jc = 3; this means that a body projected 
upwards with velocity 29*43 metres/sec will reach its greatest height 
after 3 seconds, and will then begin to fall. 

We know that if x is large, x^ is very much larger: when x = 100, 
= 10,000, and when x = —2000, x^ = 4,000,000. This shows us 
at once that if x is very large, either positively or negatively, then in 
the formula y — A Bx + Cx^ the values of the terms A 4 * Bx will 



3 ' 3 —Graphs of the quadratic functions 

(i) y 29 43X —4 905X* (iii) y = i8 — 8x I zx* 

(ii) y = 10 ( lox — 4 90SX* (iv) y = 45 — zox 4- 4X* 


become small in comparison with the term C.v®. ('I’his would not be 
so if C was exactly zero, but then we would be dealing with the linear 
expression A Bx which we have already discussed, and not with a 
quadratic one.) 'I'hc terms A + Bx may actually be large in themselves, 
but relatively speaking they will be negligible, and in that sense we 
may say that y - Cx"^ approximately for large x. But must always 
be positive, so that y must be positive for large values of x if C is 
positive, and negative if C is negative. 'I'his explains the behaviour 
of the graph at the left- and right-hand ends of the scale. To study 
the behaviour of the graph at intermediate points a special trick is 
used, that of “completing the square”. Suppose wc wanted to investi¬ 
gate the behaviour of the expression^ - - 4X H X'^. 'I’his can be written 
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as y = 2x + 2X X-, and can be pictured as in Fig. 3.4 where x^ is 
represented as the area of a square of side Xy and the two products 2x 
as rectangles of sides 2 and x. This figure suggests that if we add the 
unshaded square of area 2* = 4 we shall get a square of side (2 + x); 
that is confirmed by the identity (2 + — 4 + (^a* + which we 

get by direct multiplication. The original quantity y=^-\-x^ can 
therefore be written as _y=(2+A)^—4. Now (2 + xY being a perfect 
square is always positive, except when x= —2 when it is zero; and 
therefore y is greater than —4 except when x = —2, y = —4. Thus 
this expression has the minimum value —4 and the minimum occurs 
when A = —2. 


2 



3-4—“Completing the square'* for 4 X -r x® 


The general quadratic expression will be 

y = A + Bx + Cx^ - C + ^x + .v=) 

where A, U and C are constnn's. A similar argument suggests that this 
should be compared with the b luare 









for when this is multiplied by C the terms in x and x- agree exactly 
with those in the expression for y. 'Fhe constant term is different; 
thus 




+ C 


A 

C 


On bringing the last two fractions to a common denominator 4C® (by 
multiplyiftg numerator and deitominator bv 4C) this becomes 
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It is convenient to write a for — BjzC, and ^ for (B* — 4^C)/4C^ so 
that this becomes 

y ^ C{X - ay -C^ 

= C[{x-ay-^] . . . (3-9) 


Note on notation. The sloping stroke or “solidus” provides a 
useful way of writing a fraction, e.g. — BjzC instead of and 

B ^AC device has of course 


(B^ — 4/IC)/4C^ instead of 


4C^ 


been used several times already. But it is necessary to be clear how 
it is used in more complicated expressions. The usual convention, 
followed in this book, is that the denominator consists of all symbols 
to the right of the solidus as far as (but not including) the next + 
or — sign. (The same rule holds for the multiplication signs X 
and . as well as for the division signs and :). Thus <3/26 + c means 


a 

2b 


+ c, and not 


a 

zb c' 


the latter will be written ^2/(26 + c). 


Now (x — ay is a perfect square, and is therefore positive except 
when X ~ a, when it is zero. Thus C (.v — a)- is positive when C is 
positive, and negative when C is negative, but zero when x = a. It 
follows that if C is positive >» = C (.v — always greater 

than — except when x = a, so that y has the minimum value — 
and takes this minimum when x — a. Similarly if C is negative3^ takes 
the maximum value —Cfi when x = a. 


p R o B L E M s 

Find algebraically the maximum or minimum values of the following 
expressions, and the values of x at which they occur: (i) 3 x^, 

(2) 18 — 8x ^ 2x% (3) 45 - 20X -t 4x^ (4) 29-43^' - 4 - 905 ^'‘* 

The expression y - C (x — a)* — Cfi also shows us what the effect 
is on the graph of altering a, ^ and C. If we alter Cp we merely add a 
certain quantity on to y, i.e. we move the whole graph bodily upwards 
or downwards. If we change a we change the position of the maximum 
or minimum x = a, and simply shift the graph horizontally. On the 
other hand if we change C we change the shape of the graph: the 
greater C is, positively or negatively, the more rapidly the graph curves 
away from its maximum or minimum, and the “sharper” it is at its 
tip. 
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This trick of completing the square also enables us to find the value 
of X when we are given the value of y. For to solve the equation 

y^C{x — a)2 — Cys 

we rewrite it as 

C{x-aY^y + CyS, 

or, after division by C, 

(^x-a.Y=ylC 

There are now three cases to consider. If {yjC -1- y 3 ) is negative this 
is impossible, as (x — a)^ cannot be negative, so that the graph cannot 
take this value of y anywhere. This will happen if y is greater than 
the maximum value, or less than the minimum. If {yjC + y 9 ) is 
positive then (a: — a)^ = {y(C + y 3 ) if (.v — a) is either the positive or 
negative square root of (jy/C + / 3 ); i.e. (a: — a) = i iS). (The 

mathematical sign -y/u is usually interpreted to mean xh.Q positive square 
root, the negative one being denoted by — \/u, e.g. \/4 = 2 , but both 
2^ and (— 2.Y are equal to 4.) This gives us two possible values of a:, 
say x' = a -\- -\/ (y/C 4- yS), and x" — a — V (y/C + y 3 ). In terms of 
the original expression y — A -i- Bx + C.v- we have already shown that 
a — —^BjC and ^ = (B® — 4^C)/4C®, so that the values x\ x" can 
be expressed as [—B V (4jyC' + —4_/4C)]/2C, a:' being the 

greater of these two quantities and x" the smaller. This case corres¬ 
ponds to the values of y which arc less than the maximum or greater 
than the minimum. For example, if we throw a ball in the air we shall 
expect it to have a given height y at two distinct times a:, once when it 
is rising and once when it is falling. If y — o we have the standard 
formula for the solution of a quadratic equation A + Bx 4- Ca* = o, 
viz. A- = [ -B \ '{B- ~ 4ylC)]/2C. 

The last case occurs when (y/C 4 - ^) = o, i.e. at a maximum or 
minimum point. 'I'he equation (.v — a)- = (y/C 4- = o then has 

only one ^oIntioIl, .v - a = — jtBjC. 'I'his is in fact covered by our 
genera! fr.rmnla v -- [~B 4 : v'( 4 y(^' ~ 4-4C)]/2C; it is the 

special case wlien the square root is zero. If we throw a ball in the air 
it will be al its liigbest point Tor only one instant of time. 


PROBLEMS 



Find the values of .v 
■ 18 — Sx 4- 2.V-, (7) y 


for which y = 40 when (5) y = 3 4- .v-, (6) 
= 45 — 20.V -r 4.V-, (S) y 2g-43.v — 4-905.r2. 


1 here is one linther trick wc can perform on the expression 

have put it in the form C[(.v-'-a )2 — / 3 ]. 
It is positive or zero, i.e. it B- — 4.4C is positive or zero, we can 
break the expression up into factors. For [(.v — a)'^ — can then be 
written as [(.y - a)- - ( and so from the formula u-—v-^ 
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{u+v){u—v) we see that [(x — a)-—^]={x—a-\-\/p){x —a —V^)> or 
y =■ C[x — (a — Vi^)] [x — (a + \/^)]- If ^ is negative this process 
fails, and in fact there are no such factors, as we can readily show by 
a reductio ad absurdum argument. For suppose, if possible, that y had 
a factor {x — h): then we can write 

3, = C[{x - a.f -^■\ = {x^ h)Q 

where Q stands for the other factor or factors. In this equation put x = h; 
we then see that C[{h — a)* — jS] = o or since C is supposed not to 
be zero, {h — a)^ — ft = o, so that {h — a)'^ = ft. But since ft is 
negative this equation cannot be true, so that our assumption that there 
is a factor (jc — //) is untenable. 



Fig. 3.5—The relation between height and weight for London schoolgirls 
Continuous line : y 12 4.V* 

Dotted line: y -6702 i 2136X — 24«;2a-* ! i 2HoAr* — 238x* 


PROBLEMS 

Find the factors, if any, of (9) — 4, (10) + S-’'" + ^+ 3, 

(12) 18 — Sx 2x^ (13) 45 - 20X f 4.v^ (14) 29-43;^ — 4-905JC*. 


3«5 Cubic polynomials 

Polynomials containing .v‘, known as “third degree” or “cubic” 
expressions, may also occur. For example, the mass of a cube of 
density p and side will be px-^. In general if we have a number of 
objects of similar shape and made of the same material, then the mass 
y will be proportional to the cube of the length a:, i.c. y = Dx^. It is of 
interest to ask how far this relation holds among human beings. 
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Fig. 3.5 shows the average weight (in kilograms) of a sample of 
London schoolgirls plotted against their height in metres (L.C.C. 
report on the heights and weights of school pupils in the County of London 
in 1949). More precisely the children have been divided into groups, 
of heights from -995 to 1045, 1045 to 1095, and so on, and the weights 
of all children in each group averaged and plotted against the central 
height of the group, i.e. i-02, 107, etc. This grouping will slightly 
distort the curve, but that does not greatly matter in the present con¬ 
nection. The continuous line is the curve y = I2*4a:% and it will be 
seen that it fits the data well above a height of i -3 metres, but is rather 



Fig. 3.6—Graphs of tlic '..abic functions 
(i) y 3 - X i (iil) y ^ 

(ib y 7 r 3 x — (iv) y == —3 4 4X — + x" 


low below that height. Some such effect might indeed be expected, 
since the shape and relative proportions of liones, muscles, and other 
constituents change during growth. \\’^e shall explain later, using the 
theory of statistics, how a more accurate fit can be obtained; here 
VC merely demonstrate that the dotted line, which represents the 
fourth degree polynomial, y — 670-2 -j- 2136.V — 2492.V2 + 1280.V® 
- - 23S.v‘ gives a much better approximation. 

Tiiere arc three morals which we can draw from this. The first 
IS the danger of extrapolation. If we had onlv the portion of the cur\'e 
above 1-3 metres, we might suppo.se that'height and weight were 
always related by the equation y — i2-4.y^, and e.xtend this curve to 
smaller values of .v. We should then get wrong results. The second 
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moral is that if we choose to take a sufficiently complicated expression, 
we can get a good ftt to any smooth graph. Thirdly, we notice that 
while the expression —670-2 + 2136.V — 2492.V- + i28o.v^ — 238^:* 
represents the weight pretty accurately, at least in our range of heights, 
it has no very simple or obvious interpretation, unlike the less accurate 
relation y = i2'4a-^. Thus, although purely mathematical accuracy 
undoubtedly has its uses, it must be sought after with discretion, as 
otherwise it may sometimes obscure the real issues. 

The general form of a cubic polynomial will be y ■ - A + Bx -f 
Cx^ + Dx^. We show the graphs of certain cubics in Fig. 3.6. They 
are all sinuous curves with two bends, rather like the letter S placed 
on its side. Some of them rise steadily throughout, likejv = 2 + .v + x^, 
others rise up to a point, then fall for a time, and then rise again, like 
y = —3 -f 4JC — + x^. Some fall throughout, and others fall, 

rise, and fall. Since for large positive or negative values of .v the 
quantity x^ is much larger in magnitude than x or x-, we can say that 
for large x the curve y = A f Bx -f C.v^ + Dx^ will behave in a 
similar way to y = Dx^. If D is positive, then y will be negative for 
large negative values of .v, and positive for large positive .v. If D is 
negative the reverse will be true. Thus, unlike a quadratic, a cubic 
can never have an absolute maximum or minimum. It can, however, 


have peaks and valleys such that the value of y is greater at the peak 
and smaller at the bottom of the valley than it is for any nearby value 
of X. For example, y = —3 4- 4X — 4.v''= + -v'* has a peak when x = 
y = and a valley at x = 2, jy ^ —3. Such points are called local 

maxima and minima. 

Such local maxima are not to be despised; they may for example 
have some significance in the theory of evolution. A species may he 
better adapted to its environment in its natural cottdition than if any 
small change was made. A large change might on the other hand be 
of benefit to the species, but it may not be able to make such a change 
without going tlirough the disadvantageous intermediate conditions: 
and so it will remain at its local maxinium of adajitalion. A simple 
example is that of the rabbit: this is well adapted to life in Britain, 
and even better adapted to regions of Australia where there is a plentiful 
supply of food and few enemies. 4 ’his degree of adaptation could be 
measured in various ways the simplest being the density, or numl^er 
of rabbits per square kilometre, which results or w<juld result from 
colonization by ral)bits. '^Fhus tltis adajitatiotj has local maxima in 
Britain and Australia. 4 ”he species also inhabits Ivurope and Asia, 
with doubtless various points of maximum and ininimun^ success. 
But the Promised Land of Australia is separated from other colonized 


countries by sea, where, o( course, the adaj>tati<>n is zero, and which 
presents an impassable barrier to the species. Only by the intervention 
of man has Australia been colonized, and it presumably would have 


remained free of rabbits had man 


not interfered: i.e. the species would 
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have been confined to those regions to which it was locally rather than 
absolutely best fitted. A similar example, in which the variation is 
genetic rather than geographical, occurs in the dachshund breed of 
dogs with long jaws. The dachshund type is well adapted, if only 
because its odd shape arouses curiosity and interest among human 
beings and keeps the breed going. The normal types of dog also 
prosper, partly for similar reasons. But it seems that the lower and 
upper jaws are lengthened by different independent genes, so that an 
intermediate breed in which some but not all the dogs had long jaws 
would be sure to contain some dogs who had the genes to lengthen 
the lower jaw but not the upper one, and vice versa, and such dogs 
would be in a very sad state. Thus both the normal and dachshund 
types represent local maxima of adaptation, with a minimum some¬ 
where between. In view of this it is rather a puzzle to know how the 
dachshund type ever arose in the first place: it would appear to require 
a very happy accident for the mutations necessary for the lengthening 
of both upper and lower jaws to have appeared simultaneously. 

There is no simple device like that of completing the square which 
can be used to find the position of the maximum and minimum of a 
cubic curve, the factors, or the value of x for a given value o( y: so that 
discussion of these points will have to be deferred until Chapter 12 
where we consider more powerful methods of tackling these problems. 
We shall, however, note that for certain values of y the equation 
y = A Bx + Cat- may be satisfied for three distinct values 

of .v; that is, the graph may attain the same height y at three distinct 
points. This will be seen from Fig. 3.6 to be true of the curve y == 
7+3-^ — A'® for the value y ^ 7. In fact, in this case it is easy to 
determine the particular values of .v, for if 7 H- 3.V — .v® = 7, then 
2X — x^ — o, or in factors .r(3 — = a'(-\/3 — ^)(V3 + 

One of these factors must be zero, i.e. either x — o, or x = \/ 3 > 

A = — a/3- 

From cubic curves we may go on to fourth degree and even higher 
ones, 'rhcse can be of even more complicated shape: for example, 
y = 2.V- — .V* has two peaks and a valley between. It also becomes 
correspondingly more difficult to investigate their properties in detail, 
though we shall develop certain powerful methods later. 

3.6 Evaluation of a polynomial 

Sometimes we want to calculate tlie value of a polynomial for a 
particular value of x; for example, we might want to know the value of 
2 -f- i'7.v — I-6.V- -j- 2.v^ when .v — 1*3. We can, of course, work out 
the values of v- and .v^, and then find 2 -\- i-yx — i-6.v- -}- 2.v® by 
direct multiplication. But this is a rather laborious process. It is 
much better to write the polynomial in the form 

y - 2 4 - A' [1-7 4- A- ( — 1-6 d- 2.v)] 
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We then say that if a: = 1*3 

— 1-6 + 2.V = —1-6 + 2-6 = I 

1-7 — i -6a' + lx- = 17 + ( — 1-6 -f- 2x) 

= 1-7 + 1-3 X I = 3 

jy = 2 4 - l yx — I-6.1:* 4- 2x^ 

= 2 4-.x’(i-7 — I 6 x 4- 2x~) 

= 2 4 - 1-3 X 3 = 5 9 

In general, suppose that we wish to evaluate a polynomial y — 
A 4 ~ Bx 4- Cx^ 4 - Dx^ = A + ;c[£ 4 --^(C' + xU)\ for a particular value 
of X, say X = k. Then we calculate in turn the following quantities: 

c = D 

b = C ^ kc[= C + kD] 
a = B -\-kb[= B -{-kC kW] 

Y = A + ka[= A + kB k^C + k^D] . (3.10) 

Y is then the value of y when x is equal to k. We have illustrated the 
process on a cubic polynomial, but obviously it can be applied to a 
polynomial of any degree. If we wanted to evaluate A + Bx 4- Ca.*^ 4 - 
Dx^ 4- Ex^^ 4- Fx^ for a particular value of x% say x = k, we should 
calculate in turn e = F, d = E ke, c = Z> 4 - ^id, and so on, ending 
with Y = A + ka. 

3.7 Functions 

At this point it is convenient to introduce a new name and notation, 
which we shall frequently use later on. If .v and y are two variable 
quantities such that the value of y depends on the value of x, then y 
is said to be 2 function of x. Thus the yield y of wheat from a given 
field depends on the amount of rainfall x, the weight of an animal 
depends on the amount of food it eats, and the speed of a chemical 
reaction depends on the temperature. The cases with which we are 
mostly concerned will be those in which y is a function of .v only, 
and not of any other variables—or at any rate those cases in which 
the effect of any other variable quantities can be neglected for the 
purposes of the problem in hand. For example, the volume of a cube, 
y, depends only on the length of its side, .v, and on nothing else: given 
X we know the value of y exactly. If the temperature is kept fixed, 
then the osmotic pressure of a solution depends only on its concen¬ 
tration, and not on the shape of the vessel containing it, or any other 
such factor. 'Fhe osmotic pressure, we say, is a function of the con¬ 
centration. If we study one particular animal, then its weight y is a 
furiction of its age x, for if we specify any particular age then the 
weight is also determined. Henceforth, whenever we use the word 
“ function ” without qualification we shall mean a relationship of this 
type; that is to say, whenever we know the value of x we can find y. 
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This of course is a very different meaning of the word “function” 
from its use in physiology. 

Many functional relationships can be represented by mathematical 
formulas. If x represents the side of a cube, and y its volume, then 
y = a:®. If .V represents the concentration of a solution, and y its 
osmotic pressure, then for sufficiently small concentrations y is 
proportional to .v, so that y = Bx where B is constant. If y = ^x 

then y is a function of x. This is also true if jy is a polynomial 
in Xj such as jy = 2 15A: + 3^:®. 

It is customary to indicate that 3? is a function of x by writing 
“y = f (jc)”. [This is read as “y equals/jc” or “jy equals f of :c”.] The 
statement ""y = f {x)'* means therefore that the value of y depends on 
that of Xy but that we are not concerned with writing out in detail the 
exact relationship: we represent it simply by the letter/. Thus instead 
of writing out a long expression such asjy = [i + 3A? -h ^ 

[2 X- 5.v^] everj' time it occurs, we might write it simply as 
y — /{x). Another relationship, such as y — could 

be written y — F(x)y and still further functions as = ^{x), or y = <f>{x) 
and so on. 


Any function y — F{x)y whether it can be represented by a mathe¬ 
matical formula or not, can be put in the form of a graph by plotting 
the value of y for each value of x. 

'I’he notation F{x) has an additional convenience. Consider the 
equation y - F{x) = 29*43.v — 4-905.v2, which represents the height y 
at time .v of a cricket ball thrown vertically upwards at a speed of 
29-43 nietres/sec. Then we can represent the height at .v = o seconds 
by F{o)’y to find it we have simply to replace x by o in the expression 
for T(-v), finding J^(o) = 29-43 X o — 4-905 X o® = o. Similarly F(i) 
will stand for the height after i second, and W'ill be 29-43 ^ ^ — 
4-905 X I- “ 24-525. F{2) will stand for the height after 2 seconds, 
and F(k) = 29-43/2 — 4-905/2- the height after k seconds. In just the 
same way T(2.v) will stand for the height at time 2.v, and will be 
F{2x)= 29-43(2.v) — 4-905(2.v)=; F{x + i), the height after {x + i) 
seconds, and the h- /ht after x:- seconds, will be F(.v -H i) == 

29-43(.v -i- 1) — ^■90s{x 4 - . and F{z~) ^ 29-43(^2) ^ 4-905(j2r=)= res¬ 
pectively. 

'1 he above notation y F{x) is the standard and customary one, 
and will be found in most books. But it seems worth noting that it 
can be slightly simplified. In “/'’(.v)” the left-hand bracket always 
sticks to ib.e letter /• closer than a brother, whatever we may put inside 
the bracket, e.g. k{a-b~c-). It therefore does no work which cannot 
equally well be dt)ne by the letter F itself, and could easily and con¬ 
veniently be dropped as superfluous, "^bhus it would be perfectly 
possible to write “y - T.v)” for “y is the function F of .v”; or, even 
more simply, we may on many occasions reduce the bracket to a mere 
point, writing “y =--- F.v.”. Similarly, Fz) or Fz. will stand for whal is 



SOME POINTS IN ALGEBRA 


47 


customarily written as F{z), and Fx^) or Fx^. for the function of x^. 
This slight simplification enables us to dispense with not only one 
bracket, but often with several brackets at a time, making the formulas 
considerably more legible. Thus just as the square of y is written 
so the square of Fx) can be written Fx)-, whereas in the conventional 


Fig. 3.7—The relation z ^ xM y’ represented by a surface 



noution it has to be written [/'(Je)]^ involving three more brackets. 
As the notation Fx) or Fx. is rather unusual in appearance we shall 
keep to the standard usage F{x) in this book: but we invite readers to 
give serious consideration to the merits of ''Fx.". 


We can also have functions of two or more variables. The phrase 
'z is a function of x zudy", or in symbols, ''z^F{x,y)" or "z=Fx,y)” 
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means that when we know the values of x and y we can find that of z. 
Thus the distance to which a cricket ball is thrown is a function of 
both the velocity and inclination of the throw; if projected either 
horizontally or vertically it will not go very far. The volume of a given 
quantity of gas is a function of both pressure and temperature. If 
z — + :^ + y^y then ^ is a function of x and y. 

It is harder to represent such a two-variable function graphically. 
If z = F{Xy y)y then ideally we can represent x and y by two distances 



Fig. 3.8—Vertical sectiofis of the surface z =» x- + y- 


or “co-ordinates” measured in perpendicular directions in a horizontal 
plane and xr as the height above this plane (Fig. 3.7). The values of z 
for varying .v and y will then form some kind of surface, which will 
be a tlirec-dimcnsional representation of the function F. Unfortunately 
such a surface is difficult to construct and cumbersome to use, and it 
is usual to draw instead on a single diagram a number of plane sections 
of this surface. One way is to select a certain number of values of y, 
and for each of these values draw the graph of ar against .v (Fig. 3.8). 
This corresponds to taking a number of parallel vertical sections 
through the surface. Another method is to take horizontal sections, 
plotting the curves in the horizontal plane at which xr has a given value 
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3 - 9 )* This is precisely what we do when we plot the contour 
lines of a mountain. Unfortunately even this device fails for a function 
of three variables, such as w = F(x, y, xr). Considerable ingenuity may 
be needed to provide a really suitable diagrammatic representation of 
such a function. 



Fig. 3.9—Contours of the surface z x* I- y* 


PROBLEMS 

Sketch the form of the surface, sections, and contour lines for the 
functions (1) ^ .v -f- 2y, (2) z - x — y, (3) c — .v- -- (4) z = xy, 

{5) ^ = X + i/x — y. 

3.8 Interpolation 

If>' = F(x) is any given polynomial in .v, the procedure of Section 3.6 
enables us to find the value Y = l'\h) of this polynomial for the par¬ 
ticular value k of X. Hut quite often we may wish to reverse this pro¬ 
cedure. Suppose that we know the values y, - ^(v,), y^ — F{x^, 
Vi = F{x-^, and y ., = F{x i) of a polynomial for 4 particular values of 
namely x,, x.^, x-j, and a:,. Can we find the general expression 
y ^ A Bx 4 - Cx'^ -f- . . . for the unknown polynomial F{x)} 

Here a word in explanation of the notation a:,, .Vg . . . etc., may 
perhaps be helpful. In order to avoid the use of a large number of 
different letters of the alphabet it is often convenient to form new 
symbols by adding subscripts, such as .v,, y^, or primes, such as x\ 
y » to letters. 'I'his is very similar tf) the custom we have in human 

B 
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families of using a single surname together with added Christian 
names: “Henry Jones”, “John Jones”, “Emily Jones”, etc. Each of 
these is really a single name standing for a single individual, but the 
common surname reminds us of the family relationship. The same 
applies to zoological and botanical classification, as in Gallus domesttctiSy 
the farmyard fowl, Gallus gaUiiSy the eastern wild species, etc. Here 
*ri, »2. ^3. am are each in effect single symbols, representing four 
arbitrary numbers : but writing the relationship as jVi — reminds 

us that yi is the particular value of the function y = -F(a*) when the 
variable x takes the particular value ari. We could write the relationship 
as y = F{k)y but that would be less suggestive. It is usual to read 
“a*,”, “ao”, etc., as “.v-one”, “ A-two”, . . this is quite clear 
because the product x times 2 is always conventionally written as 2.v, 
with the number first, and not as a 2. If, however, the subscript is a 
letter, as in x^, although this may still be read as “.v, r”, it seems pre¬ 
ferable to say **x subscript r” or “a sub r” to avoid confusion with the 
product xr. 

This process of fitting a polynomial to certain points on a graph is 
very important because most smooth graphs can be approximately 
represented by polynomials over a limited range. Thus we have already 
seen that the relationship of height of father .v to average height of 
son y is represented closely by -{- 34’8 (units in inches), Avhile 

the relationship between height .v (in metres) and average weight y 
(in kilograms) of London schoolgirls is given over a certain range by 
y ~ I2'4A^, and over a wider range by a fourth-degree polynomial. 
The difficulty with the fitting in these cases is that the values of y are 
subject to a certain amount of chance experimental error, and we 
cannot discuss the most efficient methods of fitting the polynomial 
until we have developed a theory of experimental error (Section 21.10). 
Here wc shall confine ourselves to the simpler problem of finding a 
polynomial y = F{x) which will exactly relate a set of points (.Vj, j>'i), 
0^2, y^)} he. such that y^ — -^(a'i) exactly, y™ F{x2)^ and so on. 

The process of fitting goes on as follows. Suppose that we want 
the graph to pass through the four points (.v,, ji), (.Y2, ya), (A3, yg), 
(aM» >’4)* We shall try an equation of the form 


y -- /'Xa) = a + ^ (a — .Vj) -f y (x — a,)(a — X.,) 

-f- S (a — .y,)(a ~ am)(a — .V3) 


( 3 ”) 


where a, y, and 8 are four constants whose value we wish to deter¬ 
mine. Clearly, since .v,, am, am, and .v, iu-e also constants, if wc multiply 
out the right-hand side of (3.11) wc can if we wish express it in the 
form A 4- Bx -f- Ca- 4 - Dx'K (If there arc ?i points to be fitted, the 
formula ( 3 *ti) will be carried on to n terms, i.e. the polynomial will be 
of the (« — i)th degree.) 

Now by hypothesis, y 1 /'(a’i); substitution in (3.11) gives at once 

yi a, since all other terms have a factor (a, -- .v,) and so are zero. 
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This determines a. Now substitute jy 2 = 2) J thatgivesj2= « X 

{Xi — Xj), and since we know a = we can determine ^ at once from 
this equation. [In fact, )3 = {y^ — yi)/(^2 ~ •'^1) ] Substituting now 
y^ = F{xj^ we find jyj = a ^ (.V3 — ^1) + y (^^3 — x^){x 3 — ^2). 
since a and jS are known we can solve this equation for y. Finally the 
substitution 4 = ■^(•^4) gives an equation for S. 


(i) Find 


What is 

Write y = a -y px + yx{x — 0 + ~ 3 )- 

The substitution x = o, y — i, gives a = i. 

X = I, y = 20, gives a + /3 = 20, ^ = 19. 

JC = 3, y = 118, gives a + 3/3 + 6y = 118, y = 10. 

JC = 4, y = 209, gives a +4^ + I2y + 128 = 209, 8=1. 

Hence= F{x) = i + 19A: + io.v(ar — i) + x (a: — i)(.v — 3). By 
substitution of x = 2 we obtain F{2) = 57* However, the calculation 
of an expression of the form 

a + ^ (x — X,) + y (jf — ^l)(^ — X2) + 8 (x — x^){x — XaXx — X3) 

= a + (x — X,) [^ + (x — X2) ;y + (x — X3)s; j 

can be done quite simply by the same sort of trick as we used to evaluate 
A Bx Cx^ + Dx^ ^ A + X [/^ + X ;C + xD| ], viz. we calculate 

in turn 

^' = y + (X — X3)8 
a' = ^ + (X — Xg)^' 

3/ = tt + (x - Xi)a' 

Here, taking x = 2, x, = o, Xg = i, x.., = 3, we have 

/5'= 10 — I X I = 9, a' - 19 + 1x9= 28, 

■y = I + 2 X 28 = 57. 

(2) Find ■\/200 using a table of squares. 

Here we try to find a polynomial which is approximately equal to 
^/x. Although y/x cannot be exactly expressed in such a form, it can 
be closely approximated over a limited range of values of x. Now +200 
is not far from 14, and the relation y ^ \/x can be written as x = y^. 
Let us choose four values of y near 14 and square them. Wc find 


EXAMPLES 


a polynomial y — F{x) to fit the values 


X = 0 

I 

3 

4 

y = I 

20 

118 

209 


F(2)? 
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140^ = 19600, 141® = 198*81, 14*2® = 201*64, 14-3’^ = 204*49. 
nearest of these to 200 is 198*81, the next nearest 201*64, next 
196*00, and the last 204*49. We shall therefore write for the four points 
we have on the graph of y = 

= 198*81 X2 = 201*64 X2 = 196*00 ^^4 = 204*49 

yi= y2= 14-2 y 3 = h*o y*= h *3 

Now we shall calculate the polynomial fitting these four points— 

j = a -f- (.r — 198*81) + y (x — i98*8i)(:c — 201*64) 

-j- S (a; — 198*8i)(a: — 20i*64)(Ar — 196*00). 

Substitution of the four values in turn gives 0=14*1, /8= *035335689, 
y s= —0000445923, S= *00000011019, whence putting x= 200 we 
find V200 = 14*14213564, as against the true value 1414213562—an 
error of only 2 in the 8th decimal place. (The correctness of the cal¬ 
culation can be checked by substitution of Xj, .Vg, x^ and .r4 in the 
formula.) This method has the virtue that we can calculate the terms 
successively, stopping when we have achieved sufficient accuracy. Thus 
if we only use two points (atj, y^) and (.Vg, y^ we can calculate a and 
and obtain \/zoo = 14*14205, which is correct to 5 figures. This is 
shown by bringing in (.Vg, jg) to calculate y: the additional term turns 
out to be less than *0001 and gives V^oo = 14*1421365, which is 
correct almost to 8 figures. Finally if we bring in (^^4, y^) we find ■y/zoo 
almost to lo-figure accuracy. 

For many purposes indeed it will be quite sufficient to take only 
the a and fi terms, provided that .v, and .Vg were chosen sufficiently 
near the value of .v for which we want to calculate y. On substituting * 
in the values of a and we have already obtained, we find the con¬ 
venient formula 


y ^ a fi {x ~ .Vi) 

- [>’i (-Va — a:) -hyz (.V — .ri)]/(.V2 — x^) 


( 3 * 12 ) 


This can be written y = (a — /S.Vi) -j- which is of the form 
A Bx, and therefor:? has a straight-line graph. So the use of the 
a and ^ terms aUcie >:> justified it the graph is practically straight 
bet\^■een the two poi'.ts f.Vi, 3',) (.v„, j’^); the formula is accordingly 
called “ linear iiuerpoiation 

This proc .ss « i*itvrpolation is of wi<,lo use in reading mathematical 
tables. It crv.hlt s us tt) liiid the vaiuo of a function [let us sayy = -F(a?)] 
lor values of .v b( iweeii t he tahul ited values. That enables us to shorten 
tl'.e tahie, \s‘hich won’d be very bulky if e^■er^' possible value of .x* and 3.' 
which were likely to be usta had to be separately tabulated. Generally 
speaking, in ^ood :no».:im tables an effort is made to reduce the inter¬ 
polation to the simple linear formula (3.12) wherever possible; but with 
uncomiiKni functuins which nuiy not often be required this may not 
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be economically practicable, and it may be necessary to use the y, S, 
or even higher terms (see also Section 5.8). 

In most tables the values of x are equidisiantly spaced, or form a 
so-called “arithmetic progression”. In that case the formulas for 
interpolation, though based on the same principles as for unequally 
spaced values of x, are somewhat simplified. Thus a four-figure table 
of sines gives sin 45*0® = -7071, sin 45-1'^ = *7083, sin 45-2® = *7096, 
etc., proceeding at every *1°. In this case we shall call the equidistant 
values of « “x.*”, “Xq”, “x,”, “xj” . • - and the corresponding 

values oiy, “y-a”, ‘>-1”. ‘>1”. “>'2” , the numbering being 

arranged in such a way that the value of x we are interested in lies 
between x^ and x,. We shall also introduce the number n, which is 
the fraction of the interval of tabulation by which x exceeds Xo, i.e. 
n = {x ~ Xo)l{Xi — Xq). If, for example, we were interested in 
x = 45-14^, then Xo = 45 i'’, = 45'2^ are the nearest tabulated 

values, the interval of tabulation would be -i®, and n = *4. 

The formula for linear interpolation then becomes 


y = Xo + « (x, — Xo) 

= (i — «) Xo + wx, 


( 3 - 13 ) 


the first form being most suitable for manual computation, and the 
second for a calculating machine. Thus 

sin 45-14' = -7083 + '4 (-7096 — -7083) 

^ = *7088 

The next formula is that for quadratic interpolation (i.e. by the 
use of a quadratic function). 'J’he simplest and most accurate form is 

>' = >'0 +«(>'! -yo) i .l«(i ")(->' ! T Jo +Ji -J 2 ) (3-»4) 


Suppose, for example, that we wish to calculate tan 66-14° from a 
5-figure table of tangents. We have 


x_j= 660'^ 

Xo 66-1 ^ 

.V. = 66-2^ 

Xg = 66 3 ■ 


tan 6 (yo^ = 2-24604 

Jo = 66-1' = 2-25663 

y^ = tan 66 2'" = 2-26730 

j2 = tan 66 3° = 2-27806 


whence jy, —y^ — -01067, —y-i Jo + Ji “ J2 “ - -00017, and y = 
tan 66-14"' = 2 25663 -i- (•4 )(-oio67) + f (•4)( b)(—00017) = 2 25663 
4- -00427 — -ooooi • 2-26089. Here it will be seen that the correction 
for the quadratic term is very small, amounting to only -i in the 
fifth place of decimals. 'I hc correction can be negl<;cted, and formula 
(3.12) used, if the quantity {—y-t + Jo T>’i —Ja) not exceed 

in magnitude 16 times the maximum permissible error in the result; 
e.g. if we wish to get the result correct to 5 decimal places, (- -\- 

Jo 4 'Ji Jz) must not exceed 16 / 000005 • -00008. 
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The general method, applicable in any circumstances, is as follows: 
we set out a “table of differences”— 


y-2 

y-i 

yo 

yi 

y 2 

V3 

y* 



^-S/2 

^-ll 8 

•^1/2 

^ 3/2 

^ 5/2 

^ 7/2 



^-1/2 

^0 



^1/ 2 


^1/2 

/i 

^3/2 

^2 

^3/ 2 


^5/2 





'The first column consists simply of the tabulated values In the 
second column the xr’s are found by reversing the signs of alternate y’s; 
those with even subscripts are left unaltered, those with odd subscripts 
have the sign changed. 'Fhe third column is found by adding adjacent 
z's; thus -f- (and is very naturally written half-way 

between Zq and ^^3/2 -^i *7“ ~2> ^s/2 •^2 T'he next 

column is found by adding adjacent «’s; etc.; the 

cs are found by adding adjacent Z>’s, r,/2 = ^0 + and so on. (This 
process is excellently adapted to Colson notation; see Chapter 22.) 
d he general forninla for interpolation in its most convenient form 
(1-seritt‘s formula) is then 


V—v„—I ! IiJ>j -! -f-Fp/o-j . . . ( 3 . 15 ) 

where the Ih- eriit coeflicieiits /i,., /ij, /) „, Fq, are calculated as 
loi;.»\vs. Let m ■ i - ri. d’lien 


- 



w-) (2 

3 )> 

/>o 

?// t 


W-){ 2 r - 

• m-) i 2 



r- - 

-- 


and 





F, 

^ - n 

■ 

- (2 

3). 

/>, 

- ii 

(i- 

r.)\2~ 

;/-) ^ 

(■\ 

;/ 

(\~ 

;L)( 2 " 

‘>X / -> 

/i-j( 3 - 

The 

lirsi 

0 

terms r 

— * 0 


• in 

' I !• 

li ami d 

, arc less 


4 •: 5). 


■ 5 >- 4 X 5). 

n-) (2 :• 3 X 4 X 5 ;< 6 X 7), etc. 


aiui subsequent terms, 


pci .. w>s.ble error, we can omit tlie ,,d „ 

scoppin^r h J) and /, arc less than 200 times the per- 

mi-.-il>lc error, wc can sinuiarly neglect them, and similarly if//p and 
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hi are less than 800 times the error allowed they are negligible. If 
is less than 120 times the allowable error then we are generally 
safe in using formula {3.14)- 

The above formula is the most general one: it is given here for 
reference. But for the few cases in which (3.14) fails the following 
remarkable formula due to Comrie will almost certainly be found 


adequate— 

y =p. my a -j- ny, + Im (i — ni^) {b^ + -184 

—ln{i —n-){bx 4 --184^1) 


(3-i6) 


where m = \ — n. This is applicable provided that and d, do not 
exceed 2000 times the maximum permissible error. Further explana¬ 
tion will be found in the booklet, Interpolation and Allied Tables (pub¬ 
lished by H.M. Stationery Office). 

Sometimes a table gives the value of a function of two or more 
variables, say z = F(x', y), for equidistant values Xq, .Vj, x^ ... of x 
and for equidistant values y 0. >'i» 3'2 • • • y- Such a table is known 
as one of ‘‘double entry’\ If we wish to calculate for values of 

X and y between the tabulated values, the simplest procedure is as 
follows. First keeping y fixed at each of its values, y 0, yit yzt • ■ • in 
turn, we interpolate for the desired value of x, finding F{x^ y q), F(.v, yi), 
etc. Then keeping fixed at its chosen value we interpolate for y, find¬ 
ing F{x,y). In particular for linear interpolation in and y we have 

the formula 


2 = F(x,y) = (i — «)(i - Fl)F{x^,yo) + «(i “ A^)^(-v„yo) 

4 - (I -n)NF{x^,yy) ^ nN F{x,,yi) . ( 3 . 17 ) 

where n — {x — — ■*’«) = (y y o)/(yi y o)- 


3.9 Algebraic fractions 

Besides polynomials we may of course have many other types of 
dependence. Since algebraic symbols represent ordinary numbers, we 

can have fractions such as y = (x* 4' 0/( ^ 0» ^ ~ "b jV*'* 

so on. Thus when x = 2, (^ 4 - i)/(^ “ 0 “ 3 /^ = 3 - I he one 
Caution which mu^ be observed is that no fraction may have zero 
denominator: (x 4- j)/(^ — i) is not defined when x = i. 


EXAMPLES 

(i) Boyle’s law: for a gas, if ;c = the volume and p = the pressure 
at a fixed temperature, then xp constant = K. (say), or p iCjx. I he 
graph p = ijx is shown in Fig. 3.10; the curve is known as a “rectan¬ 
gular hyperbola”. It will be observed that as x approaches o, p increases 
in magnitude without any limit, or “bound” (to use the proper technical 
phrase). Thus when x -oi, p = 100, when x = 0001, p = 10,000 
and when x = -oooooi, p -■ 1,000,000. If x 001, p = 1000, and 
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SO on. We say that^ becomes “infinity’* (symbol 00) when x becomes 
zero: but that is merely to be taken as a shorthand way of saying that 
as X nears o, p becomes large, as large as we like. The interpretation 
of this would be that by sufficiently increasing the pressure on a given 
mass of gas we could make it contract to as small a volume as we liked. 



Fig* 3*io^Graphs of the rational functions 
(i) p =* I X 

(ii) p = (x- + I) (X — i) 

(iii) p (x^ — I) (x= -r i) 

At the other end of the scale if .v becomes large, p becomes verj’’ small: 
this is in sharp contrast t() the behaviour ol a polynomial function, which 
must always become large for large .v. 

\^ e sec also that Boyle s law cannot hold exactly for very great 
pressures, for if it did then we could reduce the volume .v to as near 
zero as we wish, and that we know to be impossible. In fact, by suffi¬ 
cient compression the gas will be turned into a liquid—the change may 
be either discontinuous, accompanied by condensation, or else if the 
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temperature is above the critical point it will be continuous with no 
precise point dividing the gaseous and liquid states. 

(2) Baggally’s theory of psychological forces. According to a theory 
put forward by Baggally {Inter. J. Psycho-Anal, 28, 1947), if we have 
two pleasures of a similar kind, measured by quantities x and y, then 
they combine together to give a resultant pleasure /> = + y^)l{x -f- y). 

Unfortunately it is far from clear how we can measure the intensity of 
a pleasure, so that the quantities x and y must remain at present rather 
hypothetical. But we can investigate the consequences of the theory 
in a general way. 

Let us put y = —i, i e. a unit amount of distaste, and plot the 
graph of ^ = {x^ + i)/(^ — 0 (Fig- 3 -io)- When jc is just slightly 
greater than i, this becomes very large and positive, whereas when x 
is just below i, /> is large and negative: i.e. we should get the greatest 
amount of pleasure or unpleasure under the influence of two opposite 
desires which nearly balance. Now this does often seem to happen; 
it may be related to the fact that we get a great deal of satisfaction from 
solving a problem or confronting a situation which we can master, but 
not too easily, while the bitterest disappointments are those where we 
are just beaten. If the desires .v and y are of different kinds, then 
Baggally supposes the resultant pleasure to be given by /> = (jc® + y^) -f- 
(^^2 _j_ yiy Taking y = — i, /> = (^* — + 0 * we have also 

plotted the graph of this function in Fig. 3.10. This never becomes 
infinite for finite values of a: and y, which explains why we may never 
actually experience infinite pleasures or disappointments, as would 
apparently be indicated by the first formula. 

(3) The dissociation of water. In water the product of the con¬ 
centrations of hydrogen //+ and hydro.xyl OH- ions should be constant 
(and approximately equal to i/io^’). If therefore we plot one con¬ 
centration against the other we should obtain a rectangular hyperbola, 
exactly as for Boyle’s law. 

It is easy to predict the behaviour of a fractional expression for 
large values of the variable. Consider, for example, jy =5 (2-l- 3 -^ 

(1 -H 72Ar -h x^). When x is large, the numerator will be approximately 
5**, the other terms being negligible in comparison, and the denomina¬ 
tor will be approximately 'I'hus y will be approximately 5xVx®=5/x. 
For example, if .v = 100, y = -049943, against the approximation 
5/100 = -05. 

Algebraic fractions obey similar laws to those for ordinary numerical 
ones. Thus both numerator and denominator can be multiplied or 
divided by any non-zero expression: 

I _ 5 „_ 

X + i 5x + 5 (.V i)(x i 1) .V" — 


I 
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To add or subtract fractions, bring them to a common denominator. 
Thus suppose we wish to evaluate 

__ 5 _ 

(a; -t- i)* X — 2 x(x + i) 

The lowest common denominator can be written as 4- i)%x — 2), 
since it must contain (x + 1)2, (x — 2) and x(x -f- i) as factors. The 
given expression can therefore be written 

Sx(x — 2) + x{x + i)^ — 5(x + i)(.v — 2) _ + 10 

x{x + i)2(a: — 2) “ .v(.r + i)2(.v — 2) 

[Theoretically according to our above rule we might run into trouble 
if the common denominator becomes zero. Query', when does this 
happen, and what occurs then?] 

If we wish to solve an equation containing fractions it must be 
reduced to a common denominator: thus the equation 3/(0: + 1)2-)- 
i/(.v — 2) — s/.vC-^ + i) = o becomes (v® -p io)lx(x -h i)^(.v — 2) = o. 
Multiplying both sides of this equation by x{x + i) 2 (a: — 2), we get 
.v 3 4 - 10 = o, or .V = — -^10. 


3.10 Factors 

In operating with ordinary' numerical fractions we have seen that 
it is a great help if we can split the numbers up into factors. The same 
will be true for algebraic fractions, and there are a number of theorems 
which help us to do this. 

In Section 3.6 we explained a method of calculating the value 
1' of a polynomial y --- F{x) = A + Bx 4 - 4 - Dx^ for the 

particular value .v --- k. 1 his involved the calculation of certain num¬ 
bers c, b, a, by equations (3.9) which may alternatively be written 


c = D 
b — kc = C 
a — kb ~ B 
F{k) — ka A, 


and therefore we get by direct multiplication 

Fik) -'r {a bx cx~) {—k -j- .v) 

= F(k) — ak }- — kbx ^ bx- — kcx- 4 - cx^ 

A 4 Bx ^ C.v2 -f Dx^ ==r. F{x), 

that is 

F{x) -- ^ (x - k)Q(x) 4 - F{k) 
where 0(.v) - « 4 + f.v= . . (3.18) 

I his means that if we divide F(.v) = A 4 - Bx Ca *2 -f Ox^ by 

fe/.. T ^ « 4 ^ hx 4 - c.v 2 and the remainder 

/ (A), and that the actual arithmetic process for doing this division is 
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exactly the same as the process for finding F{k)=A-\~Bk-\-Ck'^. 
Thus, in dividing 3 + 2.v + a;® + by .v — 2. we find [using (3.9) with 
k = 2^ c — D = i,b = C +2c = 3,a = B+26 = 8, F{2) = a + za 
= 19, so that the quotient is Q{x) = a bx cx- = 8 + 3^? + and 
the remainder is 19. 

Theorem 3.1. A polynomial F(x) is exactly divisible by (x — k), that 
is to say, F(x) = (x — k) Q(x) for all x where Q(x) is a polynomial, if 
and only if F(k) = o. \The '^'Remainder Theorem .] 

Proof. If F{x) = {x — k) Q{x), then F(A) = {k - Q{k) = o. 
Conversely, if F{k) = o, then the above method of division gives no 
remainder, and shows us how to find the quotient 0{.v). 

This theorem often provides us with a useful test for finding simple 
factors. 

EXAMPLE 


(1) Solve the cubic equation F{x) = —2 — .v + 2x^ + x® — o. 

Putting X = I, it is clear that F{i) = o. Therefore (x — i) is a 
factor; by direct division F(x) = (x — i)(2 + 3 -'^ + ~ ° 

we must therefore either have x — i = o or 2 3.V -t- x* = o. If 

— I = o, X = I, while if 2 3X + = o, the usual formula for a 

quadratic equation shows that x = — 2 or i. Ihe complete solution 
is X = —2, — I, or I. 


Wc can also observe two simple properties of the quotient Q(x) 
which follow from the division process. 1 he first is that the quotient 
has degree one lower than the original polynomial. If F{x) is cubic, 
then quadratic; if F{x) is quadratic, then Q{x) is linear; if F(x) 

is linear, then Q{x) is a constant. 'I’he second property is that the 
coefficient of the highest power of x in Qix) is the same as the coefficient 
of the highest power in F{x). For example, if /’(x) A Jix + Cx^ 
Dx^, Q{x) ^ a -\-bx + cx2, then D -- c, by equation (3.9). 


Theorem 3.2. If the polynomial F(x) has degree n, and the equation 
F(x) = o has n roots kj, k.., . . . kn, then F(x) = 11 (x k,)(x ko) . • . 

(x — k„), where H is the coefficient of x" in the polynomial F(x). For 
example, in F{x) = -2 - x 4 - 2x'^ 4 - x^, // = coefficient of x^' = i. 
Since F(x) = o has roots —2, —i and i. it follows that F{x) = 
(x 4- 2 )(x 4- i)(x — i) for all values of x. 

Proof. Since = o hy supposition, F{x) has a factor (x — *,), 

say F{x) = (x — /<,) Q{x). But also F{k^) ■- (k^ -- k^) Q^k^) - o, and 

since and are unequal {h., '/<,) ^ o and so Q{k.,) == o. 'I'herefore 
Q{x) has a factor (x — h-Dj i-^* ^ 

(x - kf) Q'(x). I’utting X h„ since F(/e.,) o, we see that Q‘{k^) - o. 
80 that Q'{x) has a factor (x - k,). Proceeding in this way we finally 

arrive at the expression /'(x) (x - /<i)(x kf)...{x /<„) Q (.x). 
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Since the degree of the quotient goes down by i at each division, 
and the degree of F{x) is tz, the final quotient Q'” • * '(:r) must be simply 
a constant. And since the coefficient H of the highest power a:" of a; 
in F{x) must be equal to the coefficient of the highest power in 

which in turn must be the coefficient of the highest power 
in finally find Q'" • • '(^) = which proves the 

theorem. 


Theorem 3.3. A polynomial F(x) of the nth degree cannot have more 
than n roots^ i.e. F(x) cannot be zero for more than n different values of x. 

For suppose that the polynomial had (n + i) roots, ^j, ^2 . . . 

Then by the previous theorem F(a;) = H (.%* — — kf) . . • (ar — ^n), 

and F {kn+T^ t> = H (A„+i — (^n + i ^2) • • • ^n)» which 

is impossible, since none of the factors are zero. [If H = o this would 
mean that F(.v) contained no term i/.v", and so was not of the nth 
degree, contrary' to our supposition.] 


Theorem 3.4. If two poly?iomials of degree not exceeding n are equal 
for more than n values of x, then they are equal for all x and have identical 
algebraic expressions, i.e. they are the same polynomial. 


This means that if, for example, we have an equation A -h Bx + 
Cx- + D.x^ = A' -r B'x 4- C'.v- -r D'x^ which is true for more than 
3 values of .v, then A - A', B = B', C — C‘, D = B'. For by trans¬ 
ferring the terms on the right-hand side to the left, we obtain {A — A') 4- 
{B - B’)x 4 (C — C').V“ + {D — D').x^ = o, and the previous theorem 
shows that this cannot be true for more than 3 values of a; unless 
A A' - o, B - /i' = o, C — C' = o, Z) — O' = o. 


We have already shown that a set of data can be represented 
apprdxhnatcly bv two quite different polynomials. But the last theorem 
sltuws iltut although the two graphs may be approximately the same, 
thev carj only exactly coincide at a finite number of points, and not 
««ver anv continuous range of values of .v, however short. In other 
words, every polynomial has its own individual and distinctive graph, 
which is difierent trom the graph of any other polynomial (although 
for a certain range of values of .v the difference may be only small). 


irUTHER EXAMPLE 


(2) Faciori/e F{.\,\\ z) -- X'z — .x'-y y-x — y-z -f- z-y — z^x. 

(i) Firstly, let us suj^pose that y and c are unequal: keeping the 
\alue <it r and ~ fixed, the given expression is a second-degree poly¬ 
nomial in A*. It is Zero when a- y or a* — xr (by direct substitution), 
and since y v- ~ by suppositioig we can apply 'Theorem 3.1 and write 
F - 11 (a- .v)bv x;). where II is the coefficient of .v-, i.e. (xr — y). 

Therefore F{x, y, x;) - (x: —y)(A- - y)(.v — x;) ^ (.v —y)(y — -)(;; — .v). 

(ii) Secondly, let us suppose that y — xr; then bv direct substitution 
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F = o. Therefore the equation F(jf, s) = (a: — y)(y — 

still holds good, i.e. it is true for all values of x, y\ and z, and is the 

required factorization. 

It might be supposed at first glance that we could simplify the above 
factorization by saying that since F{x,y, z) is zero whenjy = z, it must 
contain {y — .a) as a factor as well as {x — y) and {z — :v). But we 
have not proved that if it is divisible by (y — z), (jc — y) and {z .v) 
separately it is necessarily divisible by the product {x — y){y—^){z ^)- 

In fact this happens to be true. There is a more general theorem that 
if we call a polynomial “prime” if it has no factor of lower degree, 
then any polynomial can be split up into prime factors in one and 
effectively only one way. But the exact statement and proof of this 
theorem are rather beyond the scope of this book. 



CHAPTER FOUR 


\ 


COMPARISONS OF MAGNITUDES 


4.1 Inequalities 

In elementary algebra we spend much time considering equations 
like I -f- 4^: + 3^*® = o, x y ~ 2, and discussing how to solve them. 
But in practice it is important not only to be able to say when two 
quantities are equal, but also to compare them and say which is greater 
and which is less. For example, we may find it convenient to use an 
approximate formula: if we know that the error in this formula is never 
greater than -oooi we may be perfectly satisfied with the formula, 
without asking exactly what the error is. Comparisons of magnitude 
such as this are known as inequalities. 

We have already come across a number of such inequalities, as 
when we say that the function i has a minimum value i, and can 

never be less than i: the reason for that is that x- is never negative. 
We must now bring in strict definitions and rules for dealing with 
inequalities, so that wc can state them with precision and manipulate 
them with ease. 


I he first thing to notice is that we have actually been using the 
wortls ‘ greater” and “less” in two different senses—although that 
should not have caused any confusion, as so far the context should have 
made clear which sense w'as meaiu. *^I’he ambiguity in question occurs 
in comparing negative numbers: which is greater, —4000 or —4? In 
oiie sense ol the word “greater”, —4000 is obviously a much bigger 
or greater number than —4. 

An electrified wire at a potential of —4000 volts is very dangerous 
to touch; one at a potential of —4 volts is completely harmless. A 
pressure- of - 4000 kilograms weight, i.e. a tension of 4000 kilograms, 
w iil tear most common objects apart, whereas a tension of 4 kilograms 
IS comparatively harmless. On the other hand the sea bed at a height 
of —4000 feet is clearly lower than at a height of —4 feet; a man 
m Hank, i.c. an ovenlraft of is worse off 

than one with ;C( —4)’ temperature of —200® is colder than 


Now as has been said we can to some extent make this distinction 
clear in jndinary lan^age. If we say that a number is “large and 
negative then we think of something like —4000 rather than —4. 

but the best thing to do is to invent a notation which expresses our 
ideas wiUMut this ambiguity. 


bz 
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4.2 Symbols for comparisons 

The sense we shall first deal with is the one which is natural in 
comparing heights, or bank balances, in which 4000) is considered 
as smaller than ;£(—4). We shall say that is greater than a'\ or in 
symbols A > a, if the difference {A — o) is positive. (Notice that the 
wide end of the symbol “>” is next to the greater number, and the 
narrow end next to the smaller, so that the symbol is a picture of the 
relationship.) Thus 4000 > 4 > —4 ^ —4000; for all x, —4 

(because x^ + 4 ) is always positive); and :r > o is another way of 
writing “a: is positive”. In the same way we shall say that '‘a is less 
than A’\ or a < A, if {a — A) is negative. 

The principal rules of operation with these symbols are as follows; 


(A) The statements A > a and a< A are equivalent, i.e. have the 
same meaning. For if {A — o) is positive, then (a A) is negative, 
and conversely. 

(B) If A > a, then whatever .v may be, A + x r ■ a + .v. In words, 
“we may add or subtract the same quantity froin both sides of an 
inequality” (subtraction of x being equivalent to addition of .v). The 
proof follows from the identity {A + x) — {a + x) — A a\ we are 
supposing that A a, i.e. A — <2 is positive, and so {A -f- .v) (a + x) 
is positive. 

(C) If A > a and B > b then A + /i > fl - b, i.e. “we may add 

together the left-hand sides (L.II.S.) of two inequalities to give a new 
L.H.S., and the two R.II.S’s to give a new R.II.S., provided that the 
two inequality signs point in the same direction . 1 his follows from 

the identity {A B) - {a -f b) - {A - a) {B - b) > o. [Query: 
Is {A -- B) necessarily greater than {a — h)> If so, why? If not, 

why not ?] 

(D) \f A > a and k > o then kA ha. “We can multiply or^ 
divide both sides of an inequality by the same positive number /f.” 
(Division is included since division by k is equivalent to multiplication 
by ifh.) For hA ha k (A - a), and b and (A — a) are both 
positive. If, however, A is negative then the inequality is reversed: 
A > a implies AA - Aa (for example, —A < — a). 


(K) If A and o arc both positive and A ■ a, then A- > and 
y/A \/a, but \ A ■ i/w. “'I’he greater a positive number is, the 

greater is its square, the greater its square root, and the smaller its 
reciprocal.” 'I'he proofs follow from the identities {A- - a^) -- 

{A - a)(A •[ o),(VyJ Vu) (A - u)/(x/A f V"). and ijA - ija 


— {a - A): a A. 


(F) \f A B and B C then A ,' C. 

It is also convenient to have a composite symbol "A > a” or 
.. a” for "A is greater than or equal to a", and similarly "A < a" 
or "A a” means "A is less than or equal to a". 'J'he symbol 
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**A > a*\ which is sometimes used, means is not greater than a**y 
and is exactly equivalent to **A < a”. 

Theorems concerning inequalities can vary from very simple and 
obvious results to very difficult and subtle ones. There are three main 
lines of attack on the simpler cases. We can use the definition that 
A > a if {A — a) is positive; we can combine known inequalities 
using the above rules (A) to (F); and we can rely on the fact that a 
square is always positive or zero; e.g. since (.v — ly — — zx H- i > o 

we see that + i > -zx for all values of x. 


PROBLEMS 

(1) Show that the rules (A) to (F) remain true if “>” is replaced 

throughout by and by 

(2) What happens to rule (C) if A "> a and 5 > 5 ? 

(3) If A > a, then A^ > and A^ + A^ A > -i- a. 

But why does A~ > a- not necessarily hold? 

(4) For all values of x, show that x^ -f- 4^ > — 4. When does the 
equality hold ? 

(5) For all positive x, x + i/.v > 2. What happens for negative a*? 

(6) What happens to rule (E) if A and a are not necessarily positive ? 

(7) If A, a, B, 6, are all positive, and A > a, B > b, then AB > ab. 

(8) For all values of .v, JV, y, Y show that (a" + + Y^) > 

(.r.Y +jyT)-. When does equality occur ? 

(9) If .V and y are positive, and .v > what relation holds between 

i/\ .V and I'i \ y? What relation holds between i) and i/(y^ + 1 ) ? 

And between (.v- -h 4 - 2) and (y~ 4- i)/(.v^ 4 - 2 )? 

( 10 ) It « 4 b ■ c ^ o, then -j- 6^ H- 


4.3 Comparisons of absolute magnitudes 

1 he “modulus’* or “absolute value” of a number x is simply the 
mimbtT .V with its sign made positive: for example, the modulus’^ of 
4 is '> 4, v liile the modulus of -f-4 is also -*-4. 'I'he modulus of x 
is denoted by the symbol j.v[ (spoken as “mod -v”). 'Ehus, speaking 
precise!}-, we can define l.v] as follows: 


If .Y is positive or zero, |.v| = .v, 
if .V is negative, |.vj = -- .v. 


! his notation enables us to express in unambiguous fashion the 
second meaning ot the phrase “-4 is greater than in untechnical 

•"The reader must be warned that mathematicians use the word '‘modulus” in 
several quite diflcrcnt senses. "modulus of elasticity”, for example, measures the 
relauonshsp between stress and strain. Ikit the notation [.v| is never used except for 
llic absolviic value ot .v, citlier in il^e sense used above or some closclv related sense. 
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language. That is, that A is greater in magnitude than a irrespective 
of their positive or negative signs: or simply that A has a greater 
modulus (or absolute value) than a. In symbols \A\ > [<j|. Thus a 
potential of —4000 volts is greater in its absolute value than one of 
+4 volts, and is more dangerous to approach. This is shown by the 
relation ]—4000] = 4000 > I4I. On the other hand —4000 volts is 
still the lower potential, in the sense that an electric current will flow 
from the higher potential of 4 volts to the lower potential of —4000 
volts: and that is shown by the inequality —4000 < 4 without the 
modulus sign. 

In order to manipulate these relations it is necessary to know the 
properties of the function 

The chief properties are: 

For all values of a: and _v, |.yy| = |a.'| |y[ . . (4.1) 

and provided thatjy ^ o, |.v/y| = |.v[/|j| . . (4*2) 

These properties follow from the usual “rule of signs” for multi¬ 
plication and division. In multiplying ( — 232) ( - 47) we first multiply 
232 X 47 as positive numbers, and then see that the correct sign is -j-, 
since the two factors are negative. 'I’hat is, |(— 232) (—47)] = 232.47 

= I -232I . I - 47 |. 

For all |.v| = > -v . . . (4.3) 

This follows immediately from the definition of l.v|. 

For all .V and +>^1 < l-^l T [>’l • • ( 4 - 4 ) 


This is almost obvious: if .v and y were forces acting (in the same or 
opposite directions) on a body, then (.v H y) would be the resultant 
force. This would mean that “the magnitude of the resultant force 
is not greater than the sum of the magnitudes of the component 
forces”. A formal proof is as follows: 


By (4-3). (4-*). 
positive, 


2xy " \2.xy\ = 2|.vl \y\. Also, since and are 
- 1 \y\- l>y (4.1). 


On combining these two relations, by rule (H), 

a:- 4 y^ -t- 2xy < + lyl'" I 2 \x\ ly\> 

{x -\-yr ^ {\x\ 4 |y|)^ 


i.e. 


'I'aking the positive square root of each side, and using rule (K), 
we find that 

Iat 4 y| l-vl 4- lyl- 


P 
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PROBLEMS 


(1) 

( 2 ) 

( 3 ) 

( 4 ) 

( 5 ) 

( 6 ) 

( 7 ) 

( 8 ) 


Draw the graphs , , , 

y == |;c[, y=\5-\-x\+\s-xly=\x-^2\- !:v|. 

Show that 1 ^”[ = 

If 1^1 > |«1, and |Bi > \b\, show that > \ah\. 

If 1^1 > \a\ is it always true that 1^ + x| > |a + ? If so, 

why? If not, why not? 

Prove that |i/:v| = i/l^l* 

If 1^1 > 1^1 is it always true that \ ilA\ < \ila\ (provided that 

a ^ o)} If so, why? If not, why not? 


When does (a: + jy| = 1^| + 1^1? 

Prove that +jy + 5 :| < + |>'| + \^\ 

[Hint; apply ( 4 . 4 ) twice.' 


We can also use this notation to prove formally many of the relations 
we have already stated in words. For example, we have already said 
that we can make ijx as large as we like in magnitude by making jc 
small enough—suppose we were challenged to make it larger than 
1,000,000. We could reply: 

“You want 1000000 < |i/.vl = i/|.v|, do you? Well, lets multiply 
both sides of this inequality by \x\, obtaining 1000000 \x\ < i, and 
then divide by 1000000, obtaining \x\ < -oooooi. For example, 
AT = -0000005 would satisfy you, wouldn’t it?” 

Put here our challenger replies that he would really like i/x to be 
greater in magnitude than 1,000,000,000,000,000. 

Retort: It’s difficult to see why you should, but it’s only necessary 
to take \x\ to be less than -ooooooooooooooi. 

We have also said that an expression like .v- -j- 2.v — 3 can be taken 
to be .v= without appreciable percentage error, provided that \x\ is large 

enough. , , , 

Challenger: Well, suppose you say that \x\ > 1000; what per- 

c<-ntage error are you likely to commit? 

Retort: 'The actual error in taking .v- 4 - 2.v - 4 - 3 as .v- is, in magni¬ 
tude, [ 2 .V -r 3I; the percent error is therefore in magnitude 



100 | 2 .V 




(taking the error as a percentage of the assumed value .v-. If we take 
the error as a percentage of the true value (.v- + 2.v -h 3) we do not 
sirnificauth' altc-r the results, but the calculation is rather more com¬ 
plicated.) Now we can rewrite E as 



100 |2.v 4 - 3 | 

-vl 1-v ■■ ■ 


100 

X 
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Since \x\ > 1000, i/jjcj < -ooi and 


2 + ^ 

]x 


2 \ + 


Therefore 


[by (4-4)] 


= 2 + 3/|a:| [by (4.2), and 

< 2*003 [rule (E)] 

E < 100 X OOI X 2 003 % 

< *2003 % 


2 ] = 2 , 13 


i.e. the error in taking x^ + 2X -h $ to be equal to cannot exceed 
•2003% (or practically 1 part in 5®®) '^hen |a.’| > 1000. 


4.4 Bounded quantities 

We have seen that by making small enough in modulus we can 
make i/x as large as we like. Similarly by making x large enough, 
we can make x^ as large as we like. The quantity 4- i) does not, 

however, behave in this way: whatever a: may be, ij{x' + i) always 
lies between o and i. For since x^ is always positive, il{x~ + i) is 
always positive, i.e. > o. And + i) > i, so that by rule (E) 
if{x^ + i) < 1. A quantity such as il{x^ + i) is said to be “bounded” 
(i.e. confined within bounds. This corresponds exactly to the word 
“limited” in ordinary language, but unfortunately “limit” has a special 
technical sense in mathematics, which we shall encounter later). More 
precisely, we say that a fixed or variable quantity ^ is bounded by a 
positive number 13 ” if |^| is never greater than 13 , i.e. |^| < 13 . 1 hus 

-j- i) is bounded by i: we leave it to the reader to show that 
xf(x^ 4- I) is bounded by 4 . On the other hand, if x can vary without 
restriction, then x^ is not bounded. But if we restrict x to a certain 
range, such as \x\ < 100, then .r^ is bounded (here by 10,000). Natural 
quantities directly measured are necessarily bounded, so that if a 
mathematical formula for one indicates an unbounded answer there 
must be a serious discrepancy between the theoretical mathematical 
model and the actual phenomena near the point at which the formula 
gives absurdly large results. The formula may, however, fit well over 
the rest of the range. 

The rules for dealing with bounded quantities are very simple. 

Theorem 4.1. Any constant quantity is bounded (by its own modulus). 
This follows at once from the definition. 

Theorem 4.2. The sum, difference, and product of two bounded 
quantities are all bounded. For let ^2 be the two bounded quantities, 
^il < I 3 „ < 13 .i, l-^al = 1^2! /?2- Then \f 3 , + < j^,| -| 

4- similarly |^, — ^z! < + ^2. and < 13 , 

Theorem 4.3. The sum or product of any number of bounded quantities 
is bounded. 

This is just a repeated application of 'rheorem 4.2. 
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SHAPES AND NUMBERS 


5.1 The use of algebra to solve geometric problems 

In Chapter 3 we saw how an algebraic relation, such asy = 2 a; 4 - 3 » 
or y = X‘ H~ can be represented pictorially by a plane curve or 
graph. But this procedure can equally well be reversed: if we have 
a suitable curve, we may be able to express it as an algebraic equation, 
and so reduce the study of geometry, i.e. relations between shapes, to 
algebra, i.e. relations between numbers. 

This idea of using algebraical methods systematically to solve 
geometric problems was first conceived by Descartes (1596—1650), who 
called it “analytical geometry”. To a large extent the usual fumbling 
and searching for correct construction lines can thus be replaced by 
straightforward algebraic manipulation. By consistent use of this 
method we can often avoid fallacies arising from a badly drawn figure. 
For example, take a triangle ABC (Fig. 5.1) in which AB # AC, and 


A 



Fig. s.i—A proof that any triangle .4BC has equal sides AB and AC 

let the pcrpciulicular bisector of BC meet BC at D and the bisector 
of the angle A at O. Join OB, OC\ and draw the perpendiculars 
OB, OF troin O onto the sides --IC, AB respectively, ’rhen since 
BD DC, and the /_BDO ^ ^CDO, tlte triangles ODB, ODC are 
c<»ngruent. so that OB - • OC. .Also the triangles OAF, OAE are 
Congruent, since they have two angles the same and the side OA in 
common, so that AF AE and OF-- OE. Thus the triangles 
BOF, COE have two sides and a right-angle the same, and therefore 
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are congruent, and so BF = CE. If follows that AB^ BF^FA 
CE + EA = AC, contrary to the supposition that AB ^ AC. 1 he 
fallacy?—it lies simply in the plausible but inaccurate drawing of 
Fig 5 I • the points do not lie in the correct relative positions. If the 
reader will take the trouble to re-draw the figure accurately, he will 

find that there is no cause for alarm. , , 1 r u 

Just as ordinary Euclidean geometry can be developed from the 

beginning by using certain “axioms” about points, lines, angles, etc., 
so in the same way we can make analytic geometry stand entirely on 
its own feet by using suitable definitions of points, lines, etc., in terms 
of numbers. But although aesthetically it may be rather more satis¬ 
fying to be completely consistent in this way, as well as helping to 
avoid possible fallacies (as we have explained above), nevertheless it 
hardly falls within the proper scope of this book. \Ve shall content 
ourselves here with using whatever method may be the simplest and 
most suitable for each problem. We shall also assume that the reader 
is familiar with the simplest geometric propositions—that the sum ot 
the three angles of a triangle is 180^; that two triangles are similar . 
or of the same shape, if the three angles of the one arc equal to the 
three angles of the other; and that in such pairs of similar triangles 
corresponding sides are of proportional length, i.e. one triangle is simply 
a magnification or reduction of the other m a fixed ratio. \\ e shall also 
mainly confine our attention to the geometry of the plane for the 

present. 


5.2 Co-ordinates of a point 

Traditionally, as we know, the proper way to explain where to 
find buried treasure is to say “starting from the blasted tree on the 
heath, walk 40 steps to the east. 30 steps to the north, and then dig 

We can adopt the same method of specifying ^ 

in a plane. We draw two lines at right angles, X OX 1 OV, meeting 
at O. (Usually X'OX is drawn horizontally, or from west to east, and 
Y'OY vertically upwards, or from south to north, hut that is merely 
a convenient convention. O is called the ori}>in, t^e x-axis, and 

Y'OY the y-axis.) If we draw the perpendicular Pu from r onto 
the x-axis (Fig. 5.2), then if the distance OU (say) - x, and UP = y, 
measured in suitable units, we can reach P froni O by going a distance 
X “east” and a distance^ “north”, x and y are the co^orJ,nates 

of P with respect to the reference axes .Y OX and y x being the 
x-co-ordinate (or abscissa, from the Latin cut off ) and y the y- 
co-ordinate (or ordinate, from the Latin “set up ): and we write for 
short P = (x y). If the point lies on the left of the ;y-axis, or to the 
“west” then’ its x-co-ordinate is naturally cr)unted as negative. If it 
lies below (“south” of) the x-axis. then its y-co-ordinate is negative. 
Thus in Fig. 5.2, Q has a negative x-co-ordinate. but positive y. while 
R has both co-ordinates negative. In this way each point of the plane 
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can be specified by two co-ordinates; this is exactly the system used 
in plotting graphs. In using the system for geometric purposes, how¬ 
ever, we always suppose that x and y are measured in the same units, 
whereas in drawing a graph it may be helpful to use different scales on 
the two axes. (The name cartesian for this system of co-ordinates is, 
of course, in honour of Descartes, its inventor.) 



Another perfectly good way of specifying the position of the treasure 
is to say how far to go in a direct line, and in what direction to go— 
e.g. we could say “walk 50 steps at an angle of 36 9® north of east”. 
In general we can use the polar co-ordinates of P, namely r = the distance 
OP, 0 = the angle /_XOP, and we shall write P = {r, (using brace 
brackets), r is here always taken to be positive. 


5.3 Measurement of angles 

An angle is essentially measured as an amount of turning: the 
angle / XOP is the turn needed to bring the line OX into coincidence 
with OP. 'I'he natural unit of angle is the complete turn: this is cus¬ 
tomarily divided into 360 degrees (^). The ancient sexagesimal system 
disides each degree into 60 minutes ('), and each minute into 60 
seconds (") so that we may speak of an angle of 19^ 36' 36", for example. 
Hut nowadays wc more often simply divide the degree decimally, 
writing the angle 19" 36' 36" as 19-61^. The conversion is quite simple! 
since 6' -- -i®, and 36" = •01'^. 

Ill certain parts of Durope another sj'stem is used: the complete 
turn is divided into 400 “grades”, which are divided decimally. It is 
difficult to see nuich advantage in this, while it makes such familiar 
angles as 30 ' and 60^ difficult to write: but it is well to be aware of its 
existence. 
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In a plane, rotation may be either clockwise or anticlockwise. We 
usually take the anticlockwise direction or “sense” of rotation as the 
positive one—that is, the sense of rotation from the positive .v-axis OX 
to the positive ^-axis OY. Thus in Fig. 5.2 the angle 0 or “polar 
angle” of the point Q may be taken as i50^» while that for R can be 
taken as 225°, or else as —135^> according to whether we turn anti¬ 
clockwise (positively) or clockwise (negatively) from OX. In future 
therefore every angle will be understood to have its own proper sign 
associated with it, the angle /_ABC (for example) meaning the turn 

from BA to CA. ^ 

From a purely geometric point of view an angle of 360®, or a 
complete turn, is equivalent to an angle of o®, or no turn at all: the final 
result is the same. In the same way angles of 270®, 630®, and 9® 
are all equivalent: they differ only by one or more conaplete turns. 
But they may be very different from other points of view: place a 
person on a rotating stool, and his final feelings will be very different 
according to whether he is turned through 270®, 630®, or go , although 
in all cases he will end up facing the same way. 

5.4 Relation between polars and cartesians 

If the angle 6 is kept fixed but the point P is allowed to move, 
then although the values of x,y, and r will alter, their ratios will remain 
fixed. Thus if P' = {x', y') = {r\ 0 } (Fig. 5.3), then defining U' as 



{x\ o) the triangles OVP and OV'P' are similar (having two angles, 
0 and 90®, the same). Accordingly xjy = x'ly', x/r = x'Jr', and so on. 
This shows that these ratios depend only on the angle 0 and on nothing 
else. There are six such ratios. We call x/r the cosine of the angle 0 , 
or cos 0 ; yjr is the nne of or sin 0 (pronounced “sine”, not “sin”); 
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y/x is the tangent, tan 6 , xjy the cotangent, cot d, rjx the secant, sec 6 , 
and rfy the cosecant, cosec Q or esc Q. These six ratios are known as 
the trigonometric ratios or circular functions of the angle 6 , and their 
values for values of 6 ranging from o'" to 90® will be found in most 
books of mathematical tables. We shall shortly see how they may be 
calculated. 

Note that since x and y are cartesian co-ordinates, they may be 
either positive or negative, according to the position of P. As drawn 
in Fig. 5.3, both .v and y happen to be positive, but if (for example) P 
was to the left of O (when 0 lies between 90® and 270®) x would be 
negative, and if P lay below O, y would be negative. But the radius r 
is by definition always taken to be positive. 

Although the trigonometric functions are defined as ratios, their 
chief use is to find the other two sides of a right-angled triangle when 
one side is known, together with the angle 6 . For from their definitions 
we have at once: 


Given .v = OU, then 

y ^ UP = X tan 6 , r = OP = a- sec 6 
Given y = UP, then 

A = Oil ~ y cot 6 , 

Given r OP, then 

.V = OU r cos d, y -- UP = r sin 6 


r = OP = y cosec 6 


( 5-0 

( 5 - 2 ) 


( 5 - 3 ) 


\\ e can also find the angles Nvhen two of the sides are known. If, 
for example,^ ^vc know ,y and y, then ^’/a = tan 0 , is also known, 
and we cari find iroin a tabic of tangents what angle 0 gives this value 
of tan 0 . The other angle /_^OPU must be 90® — 0 , since the three 
angles ol the triangle must sum to 180®. Also, kno^ving a and 0 we 
can find r from (5.1). 

h.quations (5.1), (5.2) and (5.3) accordingly give the relations 
helween the cartesian co-ordinat -s (x, y) and the polar co-ordinates 
1^. 0 }. I hey may, h<jwevcr, be used lor manv other purposes. As 
we have seen, they give the complete solution of problems involving 
right-angled triangles. Dtlier figures can usuallv be cut into right- 
angled triangles and thereby solved (whence the word “trigonometric”, 
from the Greek lor the “measurement of triangles”). 


EXAMPLES 

I I t " *** building slopes upwards at an angle of 30®. 

I l.v iughest pnint r is 15 metres alv^ve the lowest point O. What 
IS rli« iengtii OP (T the roof? 

Gompleie the triangle OI P, as in Fig. 5.4. Then OP VP 

cmscc 30 : In .in ia(>les, cosec 30 2, and GP is given to be 1-5 metres. 

1 r.eretore OP metres. 
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We can see that cosec 30® = 2 from the following construction. 
Draw OP'y of the same length as OP, downwards from O at an angle 
of —30'". Then the triangle POP’ is isosceles (with two equal sides 



Fig. 5.4—Problem of the length of a roof 


OP = OP') and OU bisects PP' at U. But since the /_P'OP = 60®, 
the triangle POP' is in fact equilateral (all three sides equal) and 
OP = P'P = 2 VP. Thus sin 30® = UPjOP -= and cosec 30® = 
OPjUP = 2. 

(2) A man wishes to find the height of a flagpole. He walks 4-5 
metres away from it, and then finds that the tip P of the pole appears 
to be 45® above the horizon. If the man’s eyes are 15 metres above 
the ground, what is the height of the pole ? 



Fig. 5.5_Problem concerning the height of a flagpole 

I^t O be the man’s eye, O' his foot. U' the foot of the pole, and 
V the point on the pole at the same height as O. 1 hen LI 0=00 = 
x-5 m, and VP = OV tan 45' = 4 5 m, since tan 45’ = 1 (Fig. 5.5). 
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Thus the total height of the pole is U'U + UP = 6 metres. We can 
see that tan 45® = i from Fig. 5.5: for since the three angles of the 
triangle OPU sum to 180°, we must have /_OPU =45®= Z.UOP 
whence OU = UP, tan 45® = UPjOU — i. 

(3) A surveyor wishes to find the height of a hill. He finds that 
from a point Al in a level plain not far from the hill the peak P appears 
to be 10® above the horizontal, while from a point N 1 kilometre further 
away from the hill the elevation is only 5® (Fig. 5.6). What is the height 
y of the hill ? 


P 



Fig. 5.6—A method of finding the height of a hill 


Solution: referring to Fig. 5.6, AfU =■ y cot 10®, "NU = y cot 5®, 
and therefore NM — NU — AlU ^ y (cot 5® — cot 10®) = 5'859 y 
(from tables). But NAl = i km = 1000 m, so that y = 1000/5-859 = 
170-6 metres. 

(4) In a triangle ABC, to find a relation between the angles and the 
opposite sides. Draw AD perpendicular to BC (Fig. 5.7). We shall 


A 



Fig- 5-7—The relation between the angles of a triangle and the 

opposite sides 

call the angle Z.CBA, " for short. Then AD AB sin AB = 

AC sin /_C', or dividing through by sin /_B sin AC 

AB! ?\w AC ~ AC; sin AB . , . (5.4) 

— BC'i sin AA b^- a similar argument. 

If we are given any side and opposite angle of a triangle, and also 
any one other measurement (of a side or angle), we can find all the 
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remaining dimensions of the triangle using this formula. For example, 
suppose AB = 2, AC = 3, Z.C = 30°. Thus 2/sin 30'' = 3/sin /iB = 
BC/sin /_A or, since sin 30® = ^, 

sin /_B — 3/4 and BC — 4 sin /_A. 

From tables, /_B = 48-6® or i3i'4°, Z_A = 180° — /_B — /_C = 
101-4® or i8-6®, and BC = 3 93 or 128 respectively. (Thus there are 
two possible triangles satisfying the given conditions, one with 
Z.B “ 48-6®, /_A = 101-4®, = 3 ' 93 » other \vith AB = 

I 3 I' 4 ^ /.A = i8-6®, BC = 1-28.) 

5.5 Relations between trigonometric functions 

There are a number of important relations between the circular 
functions. Firstly we know that x = r cos y = r sin 6, so that 

tan 6 — yjx = sin 0/cos 6 \ 

cot 0 — xjy = cos 0/sin 0 I /. 

cosec 0 = rjy = i/sin 8 1 

sec 0 = rjx = i/cos 0 I 

so that all the six functions can be expressed in terms of sin 0 and cos 6 
alone. Note: sin 0 /cos 0 means (sin 0)/(cos 0 ), not sin ( 0 /cos 0 ). 

Now in the triangle OUP (where OU = x, VP = y, OP = r and 
AVOP= 0 ) draw the perpendicular UZ from U onto OP. Then 
/_OPU = 90® — 0 , and therefore ZPVZ = 0 . Thus (Fig. 5.8) 



OZ = X cos 0 = r cos 0 . cos 0 

= r (cos Sy 

ZP = y sin 0 - r sin 0 . sin 0 

= r (sin Oy 

But OPr== OZ ZP, i.c. r=r (cos 0y + r (sin Oy. 
If we divide this equation tlirough by r wc have 

(cos oy ^ (sin 0)2 = I 


( 5 - 6 ) 
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If on the other hand we multiply through by r, we obtain 

^2 _ ^ ^2 y 2 ^ ^ (5*7) 

which is Pythagoras’s theorem, giving another relation between a:, 
yy and r. 

It is customary to write (cos 0 )^ as cos* 0 , and (sin 0 )^ as sin®^.* 
Whatever the notation, equation (5.6) shows that there is a relation 
between cos 0 and sin $ which can alternatively be written 

cos 0 — rt I — (sin 0 )^ 

sin 0 = ± VI — (cos 0 y 

so that when the value of sin 0 is known, that of cos 0 is fixed apart 
from its sign, and vice versa. 

We can see what this means by considering the way in which cos 0 
and sin 0 vary as 0 changes, as shown by their graphs (Fig. 5.9). If 



r I then cos 0 = .r and sin 0 = y, i.e. (cos 0, sin 9) are the co¬ 
ordinates of the point on the circle r — i in direction 9 (Fig. 5.10). 
As 0 increases from to 90®, .v ~ cos 0 decreases from i to o, and 
y = sin 0 increases from o to i. When 0 increases from 90*^ to i8o^ 
cos 0 l>ccon.cs negative, decreiising from o to -i, while sin 0 remains 
positive, decreasing from i to o. For i8o'"< i^<270^ we have both sine 
and cosine negative, while for 27o^<0<36o^ only the sine is negative. 

cos J'h verr'l'i'iTnotation; strictly spe-akinR cos >0 ought to mean cos 

in Section 7 -^ In I t consistent to use the notation explained 

, t n 3 ./ and to write (cos $)- and isin 0 )^ as cos ^1* and sin «)» 1 ooicallv this 

7f ^ “V'' ''7 only Leri 

be tfres^m^ ^nibiguin': inserted evco'Avhere it would 
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W^e have now reached the initial position, and the whole sequence 
repeats. Thus any combination of signs of sin 0 and cos 6 is possible: 
in fact it is not difficult to see that 


cos (i8o® — 6) = — cos ^ 
sin (i8o^ _ 0) = sin 0 


cos (i8o® + 5 ) = — cos 0 
sin (i8o° H- 6) = — sin 0 

cos (360° — fl) =* cos (— 0 ) 
sin (360^^ — ^) = (— 0 ) — 


cos 0 
sin 0 . 


( 5 - 8 ) 



These relations enable us to find the sine and cosine of any angle, 
using only tables of angles from o® to 90®. If, for example, we^want 
sin 150“, we use the relation sin 150® = sin (180 — 150)° =:= sin 30'’= 
Other important relations connect the angles 0 and 90® — 0 . For 
in Fig. 5.10 Z_OPU 90° — 0 \ so that 

sin (9®"^ — O UjOP = cos 0 

cos (90'' — 0 ) = UPjOP = sin 0 

tan (90'’ — 0) = OUjUP = cot 0 

cot (90® — ^) = UPjOU = tan 0 

cosec (90'’ — 6 ) — OPjOU = sec 0 

sec (90^ — 0 ) = OP!UP = cosec 0 . (5.9) 

As the triangle is drawn in Fig. 5.10 0 lies between o'" and go^; 

but a little investigation will show that these relations hold for all values 
of 0 . Fortunately they are easy to express in words: “by changing 0 
into go*" — 0 , the sine is changed into the COsine, and vice versa, the 
tangent into the COtangent, and vice versa, and the secant into the 
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COsecant, and vice versa: that is, a *co-* is either added on or taken 
off”. In addition, the relation sin 0 = cos (90® — 0 ) has the practical 
use that it enables sines to be obtained from cosine tables. 

Similarly the relations cot 0 — tan (90° — 0 ) and cosec 0 = sec 
(90® — 0) dispense with the need for special tables of cotangents and 
cosecants. 


PROBLEMS 


(1) Prove that tan 0 . cosec 0 = sec 0 . 

(2) cos 0 . cosec 0 = cot 0. 

(3) (cosec 0)8 = I 4- (cot 0 y. 

(4) (sec 0 + tan 0)(sec 6 — tan 0) = i. 

(5) Using Pythagoras’s Theorem, show that |sin 0| < i, |cos 0| < i, 
for all values of 0. What follows for cosec 0 and sec 0 ? 


(6) What are the values of tan (180® + 0 ), sec ( — 0 ), sin (go® + 0 ), 
cot (180° — 0), cos (270® + 0)? 

(7) Show that if we cut a square of side i down the diagonal we 
obtain two congruent triangles of sides i, i, and \/z (= 1*414) and 
angles 45®, 45®, 90®. Find (to 3 figures) all the trigonometric functions 
of 45 °- 

(8) Show that by bisecting an equilateral triangle of side 2 we obtain 
a triangle of sides i, ^3 (= 1-732), and 2, and angles 30®, 60®, 90®. 
Deduce the values of the trigonometric functions of 30® and 60®. 

(9) Given that sin 15® = -259 and cos 15® = -966 (to 3 figures), 
calculate tan 15 , cot 15®, sec i5*^» cosec 15®. What are the trigonometric 
functions of 75® ? 

(10) Use the results of the three previous questions to draw rough 
graphs of cosec 0, sec 0, tan 0, cot 0, and (sin 0)8 for values of 0 from 
o® to 360®. 

(i i) From a boat sailing due east it is noticed that a buoy appears 

to be 15 east of north. After sailing 1 kilometre, the buoy then appears 

to be 30° west of north. How far was the boat from the buoy (a) at 

the beginning of the kilometre run, (6) at the end, (c) at its nearest 
approaclt ? 


(12) A cyclist is going up a hill at an incline of i® at a speed of 
iS kilometres per hour. Given that sin i® = -0175, cos 1® = *9998, 
at what rate (in metres per second) is his height increasing? Assuming 
that the cycle and cyclist together weigh 70 kilograms, and that to raise 
I kilogram through i metre in 1 second is to do work at the rate of 
g 9-81 watts, how many watts is he using in overcoming the pull 
of gravity, apart from the friction of the cycle and air resistance? 
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5.6 Projection and the addition formulas 

Definition: If P is any point, and PU the perpendicular from P 
onto the x-axis OX, then U is called projection ofPon OX (Fig. 5.11). 
If Q is another point, with projection W, then the segment UW is 
called the projection of PQ on OX. If PQ makes an angle with 
the direction OX, then UW = PQ cos ^pq. In this formula UW is 
to be taken with the proper sign, positive if W is to the right of U, 



Fig. 5.XX —Projection on the line OX 

and negative if to the left. In other terms, if P has cartesian co¬ 
ordinates {x', y’), and Q has co-ordinates (x", y"), then since x' = OU, 
by definition, x" = OW, then UW = OW — OU = x" — x'. 

If R is another point {x"',y"') with projection V, and QR makes 
an angle with OX, then the projection of QR is 

WV = QR cos — -sc". 

Now the projection of PR is UV = x”' — x'; and if UW, WV, 
UV are taken with their proper signs 

UW + WV ^ UV 

[or (x" — x') + (jc'" — x") = (x'” — x')]: so that if PR makes an angle 
0PH with OX, 

PQ cos '/-PQ 4 - QP cos ^QB = PR cos ^PB . . (5.10) 

This equation holds for any triangle PQR. But a particularly 
important case occurs when the angle at Q is a right-angle, and PR = 1. 
If then /_QPR = a, PQ — cos a and QR — sin a. Also in this case, 
if ^pQ = fi, then •/'qb = ^ 4 90° ^pb = a -f- Thus formula(5.10) 
becomes 

cos a . cos fi -f- sin a . cos (90® -f- ^) = cos (a -f- ^). 
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Now COS (90° -{- /S) = COS [90® — (—^)] 

= sin (—/ 9 ) [by equation (5.9)] 

— — sin [by (5.8)], whence 
COS (a + ^) = cos a . COS ^ — sin a . sin ^ . (5.1 1 ) 

This is an identity, true whatever the values of the angles a and 
may be. 

Similarly by considering projections on they-axis, OY, one obtains 
in general 

PQ sin tfsr<t + sin = PR sin ^pk 
and in the particular case in which PQR is a right-angled triangle 

sin (a + = sin a . cos ^ + cos a . sin ^ . (5.12) 

Particularly important cases of these formulas occur when a = 
we then have 

cos (2a) = (cos a)= — (sin a)^ 

sin (2a) = 2 sin a . cos a . . . (5.13) 


5.7 Calculation of trigonometric functions 

We have mentioned above the existence of tables giving the values 
of the trigonometric functions of angles. The reader may have wondered 
how these were computed in the first place. We are now in a position 
to explain one possible method of computation; this is a very simple 
method, tlKJugh not the most efficient one. We have already seen 
from equations ( 5 - 5 ) ^hat all other functions can be expressed in terms 
of the sine and cosine, while sin 6 = cos (90® — 6 ). Thus it is only 

necessary to compute a table of cos 6 in the first place; all other tables 
can be derived from this. 

The key formula is (5.13). Since (sin a)2 = i — (cos a)^ [by 
equation (5.6)], this can be written cos 2a = 2 (cos a)* — i, or (cos a)* 
- i (i + cos 2a). If we put 2a = this becomes finally 


cos = ± 


(5- 14 ) 


We row begin by taking some simple and convenient value for 8, 
say fi = 90°, cos ^ = o. This gives us cos 45° = (the -f- sign is 

taken, since cos 45^ must be positive). From cos we ffet at onre 
cos I3f ^ cos (180- ~ 45^) = - cos 45° - - i/xA. We now sub¬ 
stitute m (5.14) the values ^ = 45" and ^ = 135", obtaining cos 22-5'" = 
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once cos 157*5® = (180® — 22-5®) and cos 112*5®. Again by sub¬ 

stituting in formula (5.14) the values = 22*5®, 67*5®, 112*5® 
^57*5° we find the cosines of half these angles, i.e. of 11*25®, 33’75°» 
56*25® and 78*75® (i.e. of 1/32, 3/32, 5/32 and 7/32 of a complete turn 
of 360®). Proceeding in this way we find the cosines of more and more 
angles—the next step gives us the cosines of all angles which are 
multiples of 36o®/64, the step after that all multiples of 36o®/i28, 
halving each time. In time we shall get as near as we like to any given 
angle; and when we have enough values we can fill in the gaps in the 
table by interpolation. 

5.8 The use of tables 

The values of the sines and cosines computed as in the preceding 
section, or by any other convenient method, will usually be compactly 
presented in a table of columns of which a small part is shown in 
Table 5.1. 


Table 5.1—Table of sines 


9 

•0 

•1 

1 

•2 

■3 ! 

4 

■5 

! -6 

1 

•7 ; 

■8 

•9 

24^ 

1 

•3907 

■4067 

1 

•3923 

;4 o 83 

1 

•3939 

■4099 

•3955 

•41*5 

•3971 

-413* 

■3987' 

;'4H7 

•4003 

-4163 

[4019' 

!+*79| 

■4035 

■4195 

■4051 

•4210 


This table is to be read as sin 23*0 = * 3907 . sin 23*1® = *3923, 
sin 23*2® = *3939, etc. Now suppose we were given the angle to 2 
places of decimals, say as 23*13®, we would then use linear interpola¬ 
tion (formula 3.12). Since sin 23*1® = -3923. sin 23 2® = * 3939 . we see 
that as 0 increases by *1 , sin 0 increases by *0016. 'rhcreforc as 6 
increases by *03 , sin 0 increases by *0016 X *3 = *0005 (to 4 figures). 
We can say that “the difference in sin 0 for 03® is *0005”, and so sin 
23*13® = sin 23*1® *0005 = *3928. 

Now if we calculate sin 23*23® in a similar w'ay, we find that the 
correction for the added *03® is again *0005, the same as before. The 
reason is that the difference between sin 23*2 and 23 3® is again *0016, 
the same as between sin 23*1 and sin 23*2 . 'I'he same is true for all 
angles between 23® and 24®, so that we can give a table of differences 
applicable along the w hole row. 


Difference in angle 

1 

•01 

' i 

•02 

, 03 

-04 

1 

■05 

Difference in sine 

•0002 

•0003 

•0005 j 

•0006 ' 

*0008 
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For compactness, however, we usually express the differences in 
terms of the last figure tabulated, i.e. we call *0005 a difference of 5, 
it being understood that this is to be added to the last figure. Thus 
we find, as a rule, at the right-hand side of the table 9 “mean differ¬ 
ences”, showing the amounts which have to be added to allow for a 
second place of decimals in the angle. 


Table 5.2—Mean differences 


e 

I 

1 

2 

3 

1 

1 

4 1 

1 

5 

6 

1 

7 

8 

9 

23'' 

1 

1 2 1 

1 1 

3 

5 

6 

8 

to 

II 

1 

13 

14 

24 

1 2 

\ 

1 1 

3 : 

1 

5 

6 

8 i 

10 

I I 

13 

14 

1 


Example. 7 'o find sin 23*77 


sin 23*7'^ 


= 4019 
difference for 7 = 11 


sin 23*77 


= *4030 


One pitfall must be noted. The use of “ mean differences ” as above 
IS justified by its great simplicity and convenience, but only if no 
appreciable accuracy is being lost by using the same differences for all 
entries in the given row of the table. Fortunately it is easy to see at a 
glance when this is so: the mean differences in successive rows should 
not differ liy more than 2 it we wish to be reasonably sure of not 
commuting an error e.xceeding i in the last place. In the example 
given above the mean differences for 23'" and 24^ turn out to be identical, 
and so their use is completely justified by the above criterion. The 
accuracy can he increased, when using mean differences, by using 
differences positively or negatively up to 5 in the last place (see Section 
22.10), e.g. an angle ^23 37 would be considered as 23*40'^— '03®; 


Sill 


23*4 - ’ 397 ^* difference lor 3 to be subtracted = *000?, whence 

sm 2y37 -39^6. 

If the mean differences in successive rows differ by more than 2 
they sliouid yot he used, hut the actual correction must be calculated 
from the ^individual differences, as we have done above in evaluating 
sin 23*13 . I he printing of mean differences in a book of tables is 
not always a sufficient guarantee that it is safe to use them! 

.Another difficulty with certain tables is that differences arc negative, 
but are often printed positive, with only the most casual of warnings 
that they should be subtracted and not added. I'his occurs for example 
in many tables of cosines and reciprocals. In finding cos 22*63^ the 
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usual four-figure tables give cos 22*6® = *9232, difference for 3 = 2. 
This difference is really, however, —2, and so cos 22 63® = *9232 + 
(—0002) = *9230. 

We can also use these tables in the inverse manner: given the sine 
or cosine, to find the angle. For example, if sin 6 = -3945, we see that 
sin 23-2® = ' 3939 . difference being *3945 — ‘3939 ” ‘OO06. Now 
a mean difference of 6 in the sines corresponds to a difference ’04® 
in 6 (from the mean difference table), so that B = 23-2 + -04 = 23-24°. 


5.9 Trigonometric identities 

The addition formulas (5.11) and (5.12) give rise to a large number 
of identities. Since (5.11) and (5.12) are true for all values of a and j8, 
they will remain true if we replace ^ by — fi, thus obtaining the 
difference formulas 


cos (a — ^) = COS a . COS /S + sin a . sin ^ | 
sin (a — ) 9 ) = sin a . cos jS — cos a . sin ^ ) 


(5-15) 


The formulas for angles such as (90® - B), (180® + B) are simply 
special cases of the general sum and difference formulas. The cosine 
formulas are peculiarly perverse in having a 4 - sign in the formula 
for cos (a — ft) and a — sign in the formula for cos (a -{- ft). Since 
tan B — sin B/cos B we have, using the above identities, 


tan (a -j- ft) = 


sin a 
cos a 


cos ft -h cos a . sin ft 
cos ft — sin a . sin ft 


or, on dividing numerator and denominator by cos a . cos ft, 

tan (a + /3) = (tan a + tan j 9 )/(i — tan a . tan ft) . . (5.16) 

In the same way 

tan (a — ^) = (tan a — tan ft)l{i T tan a . tan ft) . ( 5 -* 7 ) 


PROBLEMS 

- . „ , 2 tan a 

(1) Prove that tan 2a = --r, 

' ' I — (tan a)* 

(2) cot (a + / 3 ) = (cot « . cot ft — l)/(cot a COt ft). 

(3) cos 3a = 4 (cos a)^ --3 cos a, 

(4) sin 3a = 3 sin a — 4 (sin a)^, 

(5) 2 sin a . cos ft = sin (a + ft) 4 sin (a — ft). 

(6) sin B 4 - sin (/>= 2 sin ^{0 4 - •^) ■ cos \{B — <!>). 

(7) sin (a 4- j 9 ) . sin (a - ^) = (sin a + sin ^)(sin 

(8) tan 3a — [3 tan a — (tan a)^/[i — 3 


a 


sin ft). 



84 BIOMATHEMATICS 

3.10 The distance between two points 

We now proceed to apply these formulas to the study of geometric 
cun.’^es and shapes. First of all, however, we have to work out the 
formulas for such basic ideas as the distance between two points, or 
the angle between two lines, when expressed algebraically. 

Let P — {x, y ) and Q = {x‘, y') be two points in the plane. What 
is the distance S= PQ? 



Fig. 5.12—Calculation of the distance PQ from the co-ordinates of 

P and Q 


Draw I’l '. OV perpendiculars onto O-Y (Fig. 5 . 12 ) and QH per¬ 
pendicular (>nt(j I P. Then by definition Ol" = x', OU = Xy VU = 
X — .v‘ Q//. Similarly IIP = y — _v’. 'I'herefore by Pj'thagoras 



• (S-i8) 


It the points are given in polar co-ordinates, P = (r, 0}, Q ^ 
{r'j 0'}. Draw OK perpendicular to OP. Let Z_QOP —9 — 6' = 
Then QK r sin fi, OK r' cos 4'^ KP - OP — OK = r — r' cos 
and 


. QK- -f KP- 

(;•' sin 4>)^ 4- (;• - r' cos 4>)- 
r'- (sin </>)- 4 - ^2 — 2rr' cos ^ + r'- (cos 4 *)^ 
- P — 2rr' cos 4> [by ( 5 . 6 )] 

OP^ r OQ^- 1. OP. O'Q. co^ ^OOP 


• (5-19) 
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This formula is known as the extension of Pythagoras. It gives 
the length of the third side PQ of a triangle when two sides, OP, OQ, 
and the angle UQOP = between them are known. If ^ = 90® it 
becomes Pythagoras’s theorem. The formula (5.19) can also be used 
to find an angle when the three sides of a triangle are known: it covers 
those cases of the solution of triangles not provided for by formula ( 5 - 4 )- 

EXAMPLE 

(i) If the distance from the shoulder joint (S) to the elbow (P) 
is 35 cm, and from the elbow to the wrist (IT) is 25 cm, and if when the 
elbow is fully flexed the distance from shoulder to wrist is 20 cm, 
what then is the angle l_SEW at the elbow ? 

Solution-. From (5.i9)> 

SIV^ = SE^ + ElV^ — 2 . SE . ElV cos Z.SEIV 

We have 

20* = 352 + 252 — 2.35 . 25 . cos ^SEIV 
i.e. cos ^SEIV = S2g, ^SEIV = ^4^ 


5. IX The straight line 

In Chapter 3 we suggested that the graph of the equation y = 
A Bx is a straight line. We are now in a position to give a formal 
proof. First of all, however, we require a very useful result: 


Theorem 
in the plane. 


5.1. Let P == (x, y) and Q = (x', y') be two distinct points 
Then the line PQ makes with the x-axis an angle given by 


tan 0 = (>• — y')l{x — Jc'), 


teith the proviso that if it should happen that x = x', then this is inter¬ 
preted to mean that ^ ± 90°. ['Phis is a very natural proviso, since 


tan (± 90'’) = ‘=^•1 . , i,--u 

A glance at Fig. 5-*2 is enough to establish 


the truth of this 


theorem. , », / v . 

Returning now to the graph y A ■'i P.v. let I == (x, y) be any 

point on this graph. Let Q be the point at which the graph cuts the 
>'-axis {x = o). That is to say, Q = {x, y') = (o. A). 'Phe equation 
y A Bx can therefore be w’ritten as (y - y’) - B (x* — jc'). We 

now have two possibilities to consider, h.ither x = x , in which case 

P = Q, or we can divide through by (x - x'), giving B ^ (y - y') - 
(* — x') = tan 0 , where •p is the inclination of PO to the x-axis. If 
therefore we find 0 from the equation tan ^ B, and draw the straight 
line L through Q at an angle ^ with the x-axis. tlien the equation jy = 
A-\- Bx states that P must Ue on L, i.e. the graph oi y ^ A -} B.x is 
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simply the straight line L (since P was defined as any point satisfying 
that equation). 

Actually we have jumped a step in the above argument. The 
equation tan ip = B has not one but two solutions: if one solution is 
W, the other is ^ i8o®. But this does not really matter—each 

solution simply covers one of the two halves into which Q splits the 
line Ly so that the final conclusion is unimpaired. 

By reversing the argument we can easily show that any straight 
line which is not parallel to the y-axis has a graph of the form y = 
A + Bx. The constant B = tan ^ is called the gradient or slope of 
the line, and we shall call the angle </» the inclination. 

Having found in this way the general equation y = A + Bx of a 
straight line L we can perform the followng calculations, which cor¬ 
respond to the constructions we can do with rulers, set-squares and 
protractors in ordinary practical geometry. 

(I) To find where L meets the x and y axes. On the y~axis x = o, 
whence on substitution in the equation y ^ A Bx, y = Ay that is, 
L meets the y-axis at (o, A). Similarly, provided B ^ Oy L meets the 
x-axis at {—AJBy o). For example, y = 2 + 2^ meets the y-axis at 
(o, 2) and the jc-axis at (—2/3, o). 

(II) Given the equation y = A + Bx, to draw the line L (say on 
squared paper). The simplest method is to find two points on the line, 
such as (o, A) and {—AjB, o), plot them, and join with a ruler. 


(Ill) 'ro find the line through two given points Q (a:', y') and 
P (.v", y "). het the line be y == ^4 -{- Bx; then since Q and R lie on it, 
y' r= A Bx' y y” = A Bx". We have to solve these equations for 
A and B. By subtraction we find that y' — y" — B.x' — Bx", whence 
B = (y' — y") i {x' — .r"). It follows that A = {x'y" — x"y') / {x' — x"). 
For example, the line joining (1,2) and (3, 5) isy = -I |.v. If x'=x" 
this construction fails, the line is then parallel to the y-axis, and has 
the equation x — .v'. 


(IV) To find if 3 points Q (at', y'), R (.v", y") and S (.v'", y'") lie on 
a straight line. If .v' ^ x" then the line joining Q and R has the equation 
^ A -f Bx, where A and B have the values given above. The 
condition that S should also lie on this line is y'" = A Bx'"', when 
this is written out at length it becomes 


y'" = [(-v'y" - X"y') + .V"' (y' -y"mx' - x"), 
and on multiplication by (.v' — x") becomes 

y' (.%■" - .v'") y" {x'" - x ) -i y"' (.v' - .v'') = o . 
For example (i, 2), (3, 5) and (7, n) all lie on a straight line. 


(5.20) 
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PROBLEM 

(i) Show that even when x’ = x", equation (5.20) gives the con¬ 
dition for Q, R, and S to lie on a straight line. 

(V) To find the equation of a line through a given point Q ix\ y') 
and of given direction (specified by its inclination tp or gradient 
B = tan 1^), From Fig. 5.12 or Theorem 5.1 we see that, if P is any 
point (.K, y) on the line, {y — y') = (-v — x') tan i/< R 

or y = (y' — Bx') -\- Bx = A Bx where A = y' — Bx'. 

Example. The line of gradient f through (i, 2) is jy = i + fx. 

(VI) To find the angle between two given lines L' (with equation 

y = A' -y B'x) and L" (with equation y = A" + B'’x). We know that 
V has inclination 0 ', where tan ./*' = B\ and L" has inclination i/.". 
where tan = B". The angle between L' and L", measured as the 
turn from L' to L/\ is therefore a = *//' — </''• 1 hus 

tan a = tan («/»" — </’') 

• = (tan — tan 0')/(* + 

= {B" — B')f{i + B'B") . . . (S -20 

In particular the two lines are parallel, i.e. a ^ o, if 5 = B . They 
are perpendicular, i.e. a = 90®, if tan a = co, i.e. i -j B B =0, i.e. 
B'B"= — I. 

Example. 3/ = 3 + 2:c is parallel to 3^ = 1 + 2X' and perpendicular 
to y ~ \x. 

(VII) To find the point of intersection of two lines L'(>- = A’ + B’x) 

and L" (y = A" + B'x). If 0 = (X V) is intersection, 

then both equations V = A' B'X and V — A -h BX 'iiust be 
true. By subtraction (A' — A") d (B' — B")X " o, i.e. X — (A —A ) 

(B' _ B"). By substitution in the equation y = A + B X y.'a find 

y — (A''B' _ A'B'')I{B' — B"). 'I’his method fails if B' — B", when 

the lines L' and L" are parallel. If A' ^ A" they have then no point 
of intersection [since there is then no solution of the equation {A A ) + 
{B' — B")X==6], while if = ^"the lines A'and L" are then identical, 

and any point on one necessarily lies on the other. 

Example. The lines >- = i + 2X and jy = 7 - 2.v meet at (i, 3). 

(VIII) To find if three lines L' (y - ^ A' + B'x), L" and L ' meet 
in a point (‘*are concurrent"). If so> and L and L do not coincide, 
then the point Q {X, Y) of intersection of L' and L" must lie on L '\ 
so that Y = A'" -f B"'X. On simplification this becomes 

{A' — A") B" + {A" — A'") B' d {A'" — A‘) B" = o (5.22) 

Example. The lines j = i + 2jr. - 7 “ and j = 4 - are 
concurrent. 
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PROBLEM 

(2) Show that (5.22) is still the correct condition for concurrence 
even if B' = B'\ but B' ^ B”\ What happens if S' = S" = B'" ? 

(IX) To find the distance from a point O = y') to a straight 

line L {y = A + Bx). 

Let QH be the perpendicular from Q onto L, and let P be an 
arbitrary point of L (Fig. 5.13). Then the distance QP obeys Pytha¬ 



goras’s theorem, OP- = HQ- r HP-. Thus if we imagine P to move 
up and down the line L, the distance OP will be a minimum when P 
is at //, and will then be equal to the perpendicular HQ. This minimum 
distance is called the “distance of the point Q from the line Z,”. 

To find HQ in terms of known quantities, let L meet the .v-axis 
at C ; and let QV be the perpendicular from Q onto the .r-axis, meeting 
L at K. 'Then 0 \' -= a', VQ -- - y\ by definition, and since K is on 
L, VK ^ A -\- Bx’. d'herefore KQ ^ ( v' — A — Bx'). Moreover, 
since /A 'GK -- ^ GK\' qo — .A - ///AO, so that ./A' 0 //=«A. 

It follows that HQ KQ cos /_KQll -- (y' — A — Bx') go's 0 . It 
remains onh' to express cos 1// in terms of B tan i/f — sin «/» /cos ^ 


\ I 


— I 


— (cos i/i)“/cOS i/t =- - - - 

'' ^^ V (cos 0)2 

Solving this equation, we have B- -- i (cos 0)^ _ or (cos 0)2 = 
I (P- 1 i), or cos 0 _l: i‘\ {B- -{ i). Which sign we take here 
depends on how we measure the angle 0 between the v-axis and the 
line L. There are two possible values of 0 , differing by iSo'", and one 
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of them will always He between —go° and 90^. (-/- could not be equal 
to ±90®, since then the line L would not have an equation of the form 
y = A Bx.) If we take this value of then cos ^ must be positive, 

and accordingly = + ^)» 

HQ = (y'— A — Bx')IV{B-i) • • (5-23) 

Example. The distance of the point (4, 9) from >- = i + is 
(9-i-3)/V(tV + I) = 5/^ = 4- 

Note on sign. The above expression for HQ is not necessarily 
positive, but has the same sign as (y' — A — Bx ), i.e. as KQ. That is, 
HQ given by formula (5.23) is positive if Q is above the line L, and 
negative if below. This property of the formula of giving not only 
the distance of Q from L but also showing on which side of L the 
point Q lies is often convenient; but if we want only the distance then 
we must use the formula 

\HQ\ =- \y' — A - Bx'\Iv{B~ + 0 - 

(X) To find the bisectors of the angles between the lines L' 
(y = A’ + B'x) and L". A point P is on the bisector if it is equidistant 

from the two lines, i.e. if 

y — A' — B'x , y ^ 

^ v(B"- + I) 

The + sign gives one of the bisectors, the — sign the other. 

PROBLEM 

(3) Find the bisectors of the angles between the lines y = i + 
and y = 2 — 


In this way we can repeat in algeltraic form any construction wc 
can do with ruler, set-square, and protractor. Ilowever, it is a great 
nuisance to always have to make an exception of lines parallel to the 
y-axis, which cannot be written in the form y -^A + JJx. 1 o over- 
aime that difficulty we can write the equation of the straight line in the 
general form /x + my = n. I his includes «« straight lines; if m ^ o 
then it can be written as y = «/m - /x/m, which is in the standard 
form y = A + Jix with A njm and li = - / m. If m o, then 
the equation becomes x «//, i e, is a line parallel to they-axis. (1 he 
case when both / - o and m = o must be excluded, since the equation 
becomes then simply n = o, which gives no information about the 
point (* y) and has accordingly no geometric meaning). All the 
above calculations can be repeated without difficulty in terms of the 
lx + my = n notation, instead of the notation yA + Bx. 
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FURTHER PROBLEMS 

(4) The London suburb of Croyhurst has two main streets, High 
Street running south to north, and East Street running west to east. 
These streets intersect at Ox Cross. There are four stations. The 
Underground station, Croyhurst High Street, lies 300 metres south of 
Ox Cross, and East Croyhurst Underground is 800 metres east of Ox 
Cross. The two British Railways stations are Croyhurst North, on the 
High Street 400 metres north of Ox Cross, and Croyhurst Central on 
East Street, 200 metres east of the cross. Assuming that both railway 
lines are dead straight, find the equations which represent them. (The 
obvious co-ordinate axes are East Street and High Street, with distances 
measured in kilometres.) At what point does the British Railways line 
cross the Underground? What is the angle between them? If a straight 
road from Ox Cross passes over the railway intersection, what is its 
equation? How far is it from Ox Cross to the intersection? What 
is the nearest distance each of the railways comes to Ox Cross ? 

(5) What is the angle between the lines Vx -f- m'y — 71' and 
l''x 4 - 77i"y = 71" ? What is the condition (a) that they should be parallel, 
(b) that they should cut at right-angles ? 

(6) In Fig. 5.13 P is the point (x, A -h .B.v). Write down directly 
an expression for the square of the distance PQy and find its minimum 
value as .v varies, i.e. as P moves along L. Thus verify equation (5.23). 

(7) Prove algebraically that the two bisectors of the angles between 
two straight lines L' and L" cut at right angles. 

(8) Prove that the three bisectors of the angles of a triangle meet 
in a point. 

(9) Show that the equation of a straight line in polar co-ordinates 
is of the form r — p cosec {6 — •/»). What is the meaning of “/>” (i.e. 
what quantity does it measure?) 

5.12 The division of a line 

A particularly important problem is the following one. Q — {x\y') 
and R = (.v", r") are two gi\’en points. We want to find the point 
P = (.Y, y) on the line OR such that the ratio QPjPR takes a given 
value, say (Fig. 5.14). We could use the properties of straight lines 
and distances wc have developed above, but the simplest method is 
the following: Draw PU, perpendicularly onto the a*-axis: 

then VU = (A’’ -- .v'), UIV ~ (.v" — -Y). Now if OR has inclination ip, 
VV = QP cos 0 . UIV = PR cos so that VUjUlV = QPjPR = A/^ 
by hypothesis: that is 

(.Y - .v')/(.v'' - X) = A/m. 
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Solving this equation for X we find 

X = (/xjc' + Aa:'')/(/x + A) 

and by a similar argument 

Y = (/xy' + + A) 


(5-24) 


(Joachimsthal’s formula). For example, the mid-point of QR has 
co-ordinates (i[x' + x"], l[y' + y"])- 


5.13 The parabola 

We have already named the curve represented by the equation 
y = Cx\ where C is a constant, a “parabola”. More general y the curve 
y = A Bx -b Cx^ has been called a parabola (Section 3 ' 4 )> 
there shown that this can be alternatively written + Cp 
By shifting the origin of co-ordinates to the point (a, - C^) ,.e. the 
maximum or minimum point, this becomes simply y C.v . But this 
change of origin and axes of co-ordinates is merely an algebraic con- 
venience. which does not affect the curve itself We shall therefore 
consider only the form of equation y - It is then possible to 

deduce certain properties of the curve from this equatirm 

The origin (o, o) is called the “vertex” of the parabola. If C > o 
it is the lowest, or minimum, point; if C < o it is the maximum I he 
y-axis X = o is the “axis” of the curve, and since the values of y at 
+x and -X are the same the curve is symmetrically placed around 
the axis (Fig. 5.15). T’he point F (o, 1/4C’) is called the focus of the 
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parabola, and the line y = — 1/4^ is the “directrix**. The directrix 
meets the axis at the point D (o, —1/4C) such that FO = OD. 



Now let P be any point (.v, y), and let 8 be the distance of P from 
the directrix: then 

8 = b + 1/4CI 
Also PF- = + (v — i/4C)=, so that 

52 _ pjr2 ^(y ^ i 4C)2 -(y- lUcr - -V" 

= (y - C.v=)/C 


It follows that 8 — PI\ or equi\’alently 8 “ — PF^ (since both 8 and 
/’/•’ are positive), if and only if 1’ = Cx~, and P lies on the parabola. 
Thus a jiarabola may alternatively be defined as the path of a point 
P which mtjverf in such a way that PF 8 where PF is the distance 
of P from a certain fixed point F (the “focus”) and 8 is the distance 
of P from a certain fixed line (the “directrix”). 

ine the points of intersection of the parabola 
y (\\- and an arbitrary straight line y - .4 Bx. At any point of 
intersection (A'. Y), both equations V ~ CA'- and Y =-- A + 5 A" must 
hold, and so by subtraction A r BX — CA- --- o. This is a quadratic 
equation: it may have tw(j roots (if B- 4.4C o), no roots (if 

JF r 4 . 4 C-' - ' o), or exceptionally a single root {B- -i- ^ 4 C o). Thus 

we sec that a straight line usually meets a parabola either in two points 
tyr none. I'^xceptionaily it may meet the curve in a single point. 


5.14 The circle 

A *.irclc is the path of a moving point P whose distance PC from a 
given fixed point C (the “centre”) has a constant value R (the “radius”). 
Accordingly wc can readily write down the equation of a circle from 
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the equations (5.18) and (5*^9)- If C" — (.Vq, y0) o> ^0} 

p_ {x^y) = {r, 6 ) the equation becomes in cartesians 


PC* = {x — :Co)’' + (>' - Jo)" = R- 
+ jy2 _ 2JfXo — 2JJo + (^ 0 " + J 


or 


0 -/?") = o 


(5-25) 


and in polars „ r.o / 

r* — 2rro cos (9 — 0 o)+^o —■ • (5-26) 

The distinguishing feature of the equation of a circle in cartesian 

co-ordinates is that it contains ternis in jc*, y^ x, y, and a constant 

term, the coefficients of x^ and _>’* being equal. 

In the particularly simple case when the centre C is at the origin, 

the equations become 


+ j" = 


or 


r = R 


(5-27) 


The circle is such a familiar and simple curve that it is ^'fficult 
to deduce from the above equation any properties which are bot 
useful and not immediately obvious. We may note one Pr^ty 
equation (5.27) of interest. Up to this point most equations have been 
given expllitiy in the form “j = a funct on of .v’ ; a orm which is 
most convenient for plotting a graph. But equation (5.27) g‘ves a 

relation between a: and y which confines t*'*" f/',’ 

just as effectively as an explicit relation, y is then said to be ‘■"P' 
function’* of a:. It is true that we can m this case readily solve (5.27) 
to give J = ± general principle 

that almost any relation between a- and y will define a curve. 

Incidentally another moral to be drawn from the equation of he 

circle v = ± J\R^ — a*] is the necessity for the use of unending decimals, 

circiej . ± yf^, J . ir limited ourselves to exact fractional 
as explained in Chapter 2. n we iiiiuitu /.f v 

, * ^ iIa then for most values 01 x tnere 

values for x and y (such as 1/4, 27/43; ,, /.virt 

would be no valGe of y. since [R^ - v'] would 

square root. It would no longer be possible m f 

continuous curve. It might be possible to r jj^endins 

sufficiently ingenious dodges-but on the whole the use of unending 

decinrals gains much in simplicity and flexibility, even .1 they arc really 
only a convenient fiction. 


PROBLEMS 

(1) What is the equation of a circle with radius 25 and centre 

(o, -15)? What are the points of intersection ^ 

X and y axes, and with the lines x -- Jt ^ 2y> y 

(2) Show that in general a circle and straight line cut in two or 

no points, exceptionally in one. 

(3) A circle passes through the three points (0, 0), (5, 

(8, 4). By substituting these in the general equation (x x„) + 
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(y — its centre (a:o. >'o) and its radius R, Where does 

the circle cut the lines y — 4^: + 33^ = 20 ? 

(4) A circle passes through the three points (4-4, 12*3), (10*4, -3), 
( — 10-4, —iO‘i). What is its centre and its radius? 

(5) Show that the points Q = (fl, b) and R = {a^ —6) He on a circle 
with centre at O. Find the equation of this circle. If P = (a:, 3^) is any 
point on this circle, find tan ^QPP, and show that this does not change 
as P moves round the circle. 

5.15 The ellipse 

If a circle is squashed it becomes an oval curve called an “ellipse**. 
More precisely let us take a circle with centre O and radius a, and 
imagine it compressed in a vertical direction, so that the distance of 
each point from the A;>axis is reduced to i/cth part of its value, while 
the distance from the 3'-axis is unaltered: the resulting curve is an 
ellipse (Fig. 5.16). 



If P ~ (.V, 3’) is a point on the ellipse, and P' the point on the circle 
from which it was derived by compression, then by definition P'=(.v, cy), 
or UP ~ UP'fc in Fig. 5.16. Since P' lies on the circle of radius a, 
X and y must satisfy the condition + (cy)® = a-, or on dividing 
through by a-, 


where 


-f = I . 
b = ajc 


. (5.28) 

• (5-29) 
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(5.28) is therefore the equation of the ellipse in cartesian co-ordinates. 
In polar co-ordinates, since x = r cos 6 , y r sin 0 , it becomes 

/r cos^\^ /rsin 
(—) + = ■ 


From the equation (5.28) it is clear that if {x, y) is a point on the 
ellipse, so that x'^ja^ y^jb- = i, then (— x, (.v, y), and ( x, y) 

are also on the ellipse. In other words the curve is symmetrical about 
both the :x:-axis, or “major axis’’, and y-axis, or ‘ minor axis . It 
intersects the x-axis at the points A = (o, a) and A = (o, a), 

AA' = 2a is called the “(length of the) major axis’’. Similarly the 

curve cuts the minor axis at 5 = (o, b) and B' = (o, —b). 

From the symmetry of the equation we deduce that we can equally 
well look upon an ellipse as a circle of radius b expanded in the ratio 
c = ajb in the direction of the x-axis. (We always take the major axis 

to be the longer axis, so that a > Z», and c > 1.) 

A curve of this simple form may be expected to occur fairly often 
in natural objects. For example, the gravid uterus is approximately 
a “prolate spheroid’’, i.e. an ellipse rotated about its major axis, 
whereas a sea-urchin is roughly an “oblate spheroid , i.e. an ellipse 
rotated about its minor axis. Certain bivalve molluscs have elliptical 
lines of Bro\vth (d’Arcy W. Thompson’s Growth and Form p. 583) 
and muscles may also be elliptical (sec S. Haughton’s Animal Mechanics). 
In the physical sciences ellipses arc of even more frequent occurrence. 
A penny or other circular object looked at obliquely appears to e 
elliptical; a disturbed pendulum swings in a near ellipse; bridges and 
tunnels often have elliptical arches. Most of the planets and the sun 
are, in varying degree, oblate spheroids in shape, and the planets have 
very nearly elliptical paths relative to the sun. In biological statistics we 
have “ellipses of correlation” (Fig. 20.7) and “ellipses of discrimination 

(Section 21.11). 


Properties of the ellipse 

In Fig, 5.16 the angle /^.XOP' is called the "eccentric angle’’ ^ of 
the point P. (N.B. 'I’his is quite different from the polar co-ordinate 
d = /_XOP.) Since OP' OV a cos <^, = 

{a sin rfj)jc = b sin <^, and so P -- (« cos <P, b sm «/>). If P is the point 
on OP' for which OP ' h, then P " has y-co-ordinatc b sin the same 
as for P, so that P 'P is parallel to OX. 'I'his gives a construction for the 
point P with eccentric angle <f>'. draw the line OP / at inclination 
cutting the circles of radius b and a at P" and P' respectively, and draw 
P''P, PP' parallel to the axes to meet at P. By repeating this con¬ 
struction we can find as many points of the ellipse as we wish. 



96 


BIOMATHEMATICS 


PROBLEM 

(i) Verify the following alternative construction for an ellipse. Take 

a straight ruler HK of length a -h 6, and divided at the point P such 
that HP = a, PK = b. Let the end H be constrained to move along 
the y-axis, and K along the x-axis. Show that when Z_HKO == 

P is the point (a cos h sin ^), and so as the ruler HK. moves, P describes 
an ellipse. 

This construction shows us incidentally yet another way of repre¬ 
senting a curve. An ellipse is the path described by a point P = (a cos 
h sin 0 ) which moves as ^ is allowed to vary. This is called the **para- 
metric form” of the curve, in contrast to the “implicit” and “explicit” 
forms of the equation, x’^la^ + y^jb^ = i and y = ± ^ V(^ — x-ja^) 
respectively. ^ is the “parameter”. 

The property that an ellipse is a circle “pulled out” along its major 
axis makes it almost obvious that the section of a circular cylinder by 



Fig. 5.17—An ellipse as a section of a cylinder 
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a plane A (not at right angles or parallel to the axis) is an ellipse. Con¬ 
sider a cylinder of radius b intersected by this plane A, which passes 
through a point O on the axis of the cylinder and makes an angle a 
with the axis (Fig. 5.17). Complete the construction as follows: a 
plane {W'YWY') perpendicular to the axis meets the section plane A 
in the line Y'OY, and IV'OIV is the line perpendicular to Y'OY in 
this plane. X'OX is perpendicular to Y'OY in the cutting plane A. 
P is an arbitrary point on the curve of intersection of /I and the 
cylinder, PJ the perpendicular from P onto FOi . 'I he intersection 
of the plane W'YIVY' and the cylinder is a circle of radius b. Let P’ 
be the point on this circle such that PF' is parallel to the axis of the 

cylinder. „ 

Then referred to axes X'OX, Y’OY the co-ordinates of P are 

while referred to axes \V'0\V\ T'O T the co-ordinates 
of P' are JP' and OJ = y respectively. But ^P'PJ = a. so that 


X = yP = JP' cosec a. 

Since P' lies on a circle, it follows that the curve of intersection on 
which P lies is a circle of radius b expanded in the ratio cosec a in the 
direction of the :c-axis OX, i.c. it is an ellipse with major axis 

• (5-30) 


2a 


= 2b cosec a . 


and minor axis 2b [so that c = ajb - cosec a], , 

Now imagine a sphere wliich just fits mside the cylinder (and so 
has radius A) to be moved down the cylinder until it touches the plane 
of the ellipse at /•’. 'I he sphere will then have centre A say, and will 
touch the cylinder along a circle C which lies in a plane // perpendicular 
to the axis of the cylinder. Let G be the point o 6 which lies on the line 
PP\ Similarly by running an equal sphere down the cylinder from 
the other end, we can find a second point of contact/- , a centre Z , a 
circle C\ a plane //' and an intersection G". It is cieair from the symmet^ 
of Fig. 5.17 that /••, /■•■ lie on the major axis, that /• U y U/- Z U - OZ, 
and that PG touches at G the sphere with centre ("'c shall cave 
the reader to supply formal proofs if he wishes Now / /■ is also a 
tangent to the same'sphere from P', and since all tangents from P to 
the sphere are equal, we must have 

PP PG. 


In the same way PP' = PG', and so l>y addition 

PP -{ PP' PG t P(P 

^ the distance between 11 and TT 
. a constant independent of P. 

We can readily find the value of this constant by moving P along the 


H 



q8 biomathematics 

ellipse until it coincides with A, the end of the major axis. Then 

PF + PF' = AF AF' 

— AF + A'F (by symmetry) 

= AA' = 20 . . . - (5-31) 

This gives us a new definition of an ellipse: “an ellipse is the path 
of a point P which moves in such a way that the sum of its distances 
from two fixed points F and F' is constant”. F is called a “focus” of 
the ellipse; and thus it has two foci, F and F'. We can readily find the 
distance OF, for in the right-angled triangle OFZ, Z^FOZ = a and 
FZ = b, so that 

OF = b cot a . . . . (5.32) 

We can use this focal property of the ellipse for an alternative 
geometric construction. If we place two drawing pins at the two foci, 
F, F', and run round them a loop of string PFF' of appropriate length, 
then by keeping this loop taut by a pencil at P we can make P describe 
an ellipse. 

Now the plane TJ will cut the plane of the ellipse in a line L per¬ 
pendicular to O-Y and cutting it at a point D. This line L is called 
the “directrix ” of the ellipse corresponding to the focus F. Similarly 
there will be a second directrix L' corresponding to the focus F' 

(Fig. 5.17). 

Now produce JP to meet L' at right angles at a point H' (JP is 
parallel to OX'). 'Then /^H'G'P — 90" and /_G'PH' = a, so that 

PF’ = PG' = PH' cos a . . . (5 33) 

'I'his gives yet another definition of the ellipse. PF' is the distance 
of P from the focus F', and PH' the distance from the directrix L'. 
Accordingly “an ellipse is the path of a point P which moves in such 
a way that its distance PF' from a fixed point F' bears a constant ratio 
to the distance PH' from a fixed line L', 

PF'jPH' = € = const.” 

vSince c -- C{)s a we must add the condition that € < i. If « = i we 
have seen in Section 5.13 that this definition gives us a parabola, not 
an ellipse. 'Fhe constant e is called the “eccentricity” of the curve. 

Since from (5.30) sin a = b!o, we must have the connecting 
eipiation 

-^{bluy^= -r I . . (5.34) 

FURTHER PROBLEMS 

(2) Draw the ellipse with semi-axes -- 4, ^ — 3, and verify both 
from the figure and from its equation that it passes through the points 
( il: 2*4, ± 2-4) and (± 3-2, ± i-8). 
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(3) Show that the distance from F to (o, b) is a. Where are the 
foci of the ellipse of problem (2)? 

(4) Find the intersections K and K' of the ellipse x^ja- + y^lb"^ = i 
and the line y = I mx. Find the mid-point S of the line KK\ and 
show that it lies on the line y = — b^xja-m. 

Prove that in Fig. 5.17 

(5) OF' = a €. 

(6) OD = OD' = al€. 

(7) OZ = OZ' = a. 

(8) Prove the focus-directrix property algebraically. Take the 

focus F to be the point {ae, o), the directrix L to be .v = o/e, the point 
P to be (jc, y) and PH to be the perpendicular from P onto L. Write 
down the equation PF'^ = in terms of these co-ordinates, and 

show that it reduces to x^ja^ + y-jb^ = i. 

(9) Show that the focus-directrix property implies the relation 
PF + PF' = za. 

5 «z 7 Conic sections 

Now let us consider a plane section not of a cylinder but of a 
circular cone. A circular cone is defined in the following way. Take 
a fixed straight line VZZ' called the “axis”, and a fixed point V on 
the axis. Let a variable line VGG' through V move in such a way 
that the angle /_GVZ remains fixed (at, say, ^). 'I’hcn the line VGG' 
traces out a cone (Fig. 5.18 overleaf). 

Now consider the intersection of the cone with a plane A (making 
an angle a greater than /3 with the axis of the cone) (Fig. 5.18). Let 
P be any point on the curve of intersection. As in the discussion for 
a cylinder we can imagine two spheres touching both cone and plane A\ 
one has, say, centre at Z, touches the cone in a circle C and the plane A 
at a point F, and the other has centre Z', touches the cone in C and 
the plane at F'. Let the line VP cut C in G, C in G'. Then PF — PG, 
since both are tangents to the sphere; PF' = PG', and PF -\- PF' = 
PG -}- PG' == GG' = a constant. 'Thus by equation (5.31) the curve 
traced out by P, i.e. the intersection of A and the cone, is an ellipse 
with foci at F and F'. Furthermore let L be the intersection of A and 
the plane /7 containing the circle C. 'Then PF - PG 

= (distance of P from II) (sec a) 

— (distance of P from L) (sec a . cos ^). 

Thus L is the directrix corresponding to the focus F, and the eccentricity 
< = sec a , cos fi. 

So far we have obtained the same curve—an ellipse—whether it is 
a cylinder or a cone that we cut through. Hut by allowing the angle a 
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Fig. 5.18—An ellipse as section of a cone by a plane A 
The angle between A and the axis VZZ' is a 

between A. and the axis of the cone to decrease we can obtain new 
curves, and by the sphere construction we can see that they must all 
have the focus-directrix property with eccentricity € = sec a . cos 
(The change in shape can be watched by shining a conical beam of 
light from a torch onto a wall A.) As a decreases the ellipse becomes 
longer and the second focus F' recedes into the distance so that when 
u = /S there is only one focus F, the eccentricity e = sec a . cos = i, 
and the curve has therefore become a parabola. When a decreases 
still further we get a new cur\-e of eccentricity sec a . cos )S, greater 
than 1. 'This curve is known as a hyperbola. 

Now in geometiy a straight line is imagined as extending indefinitely 
in both directions, so that the complete cone, defined as the figure 
swept out by a line through the vertex V making a fixed angle with 
the axis, is really a “double cone’* consisting of two ordinary or “single 
cones” meeting at the vertex V (Fig. 5.19). When we talk of a cone 
we usually think of a single cone, which can be easily constructed in 
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paper or wood as a rigid model—but geometrically speaking it is the 
double cone which is the complete figure. Now when a < ^ the 
cutting plane A intersects both halves of the double cone, so that the 
hyperbola splits into two parts which are not joined together. We can 
fit a sphere touching A into each single cone (Fig. 5.19), so that there 
are two foci F and F', one inside each half of the curve, by considering 
tangents from a point P on the curve to these spheres, we can show 

that 

PF - PF' = 1! 2« . . . (5.35) 

where za is a constant (the distance between the circles of contact 
of the two spheres measured along the cone). 

We can find the equation of the hyperbola as follows (Fig. 5.20). 
Let F be a focus, L the corresponding directrix. Draw FD perpendicu¬ 
larly from F onto L and produce it to O where OF = t-OD, i.c. OD ^ = 
DFj{€^ — i). Take OF as ;t-axis, and a perpendicular line OT as 
>-axis. Denote the distance € y OI) by a, so that F has co-ordinates 
{a€, o) and L has equation jc ajc. If P is any point on the curve, 
then |P//|, the perpendicular distance of P from L, is given by 

l/v/i - k - («/^)!. 
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while PF^ — (.V — ae)- But the focus—directrix property shows 

that PF“— i.e. (.v —«c)-+>'"= €'-(.v—a/e)®, which reduces to 

.%-/a- ->•-/*-= I . . . (5.36) 

where b — a\ {er — i). 

From this equation we see that if (.v, lies on the curve so also 
do ( — A% 3 ), (.V, —3') and (—.v, — y); i.e. the curve is symmetrical about 
both the .v-axis, or “major axis”, and v-axis, or “minor axis”. The 
point O is called the “centre”. 'The curve meets the major axis at the 
“vertices” (o, —a), but does not cut the minor axis .v = o at all. 

.\ peculiarity of the hyperbola which is not shared by the ellipse 
or parabf)Ia is the existence of “asympttjtes”. The two lines 3.- = bx/a^ 
3 ' ~ ■ b\\<i are called “asymptotes” (frtjm the Greek asymptotos, not 
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coincident, not meeting). If P is a point moving along the hyperbola, 
then the further it gets from O the nearer it approaches an asymptote, 
although it never actually lies on the asymptote (Fig. The 

effect of this is that the further parts of the hyperbolic curve are 
practically straight and almost coincide with the asymptotes. 

To prove this, let P be a point on the curve. Without loss of 
generality we can take the case in which the co-ordinates of P, (a:, y), 
are both positive: the other cases will follow by symmetry. Draw PN 
vertically upwards to meet the asymptote y = bxja at A^. N will then 
be (jc, bxja), and the distance PN will be {bx/a —y) = b{xla yjb). 
But P lies on the hyperbola, so that 


{x^fa^ —y-lb^) = {xia + ylb)ixla - yjb) = 1 

whence {xla){xja — yjb) < i- On multiplying both sides by the 
positive number bajx we obtain PN = b {xja — yjb) < Aw/.v. Now as 
P moves away from O, x increases indefinitely in value, and ipa!^ can 
be made as small as we like by making x sufficiently large. I hat is, 
PN can be made as small as we choose by going far enough away 
along the hyperbola. On the other hand the asymptote docs not 
intersect the hyperbola, for on the asymptotes y^lb- o, 

whereas on the hyperbola x^ja"^ — y^lb- “ i- ^Ve can relate the ^inpe 
of the asymptotes to the eccentricity c of the hyperbola. Ify== Z_XON 
(Fig, 5.21) is the angle of inclination of the asymptote y == bxja, then 
tan y = bja. But b^ - - 1), i.e. (tan yY = (sin y)V(cos yY = 

- I. Therefore £= = [(sin yY 1 (cos y)-]/(cos yY -- i/(cos y)- -- 

(see yY, and ■ 

€ = sec y ♦ - • • 15 - 37 / 


Another useful property of the asymptotes is the following one. 
Let A/ be a point on one asymptote (y -- bxja), and M a point on 
the other (y = — bxja). Let MM' intersect the hyperliola in P and / 
then MP = P'M'. 'I'o prove this, let M (x, bx’a), M' = {x', -bx'ja). 
Let P be a movable point on the line MM , and put MRjMM — 
We wish to find what is the value of A when P lies on the hyperbola, 

i.e. coincides with P or P'. Now MR X.MM , RM — MM MR 
= (i — A) MM', so that R divides MM' in the ratio A//*, where fi 
= (i — A), 'rherefore, using Joachimsthal's formula (5.24), R is the 
point 

([x - Xx Xx'l b[x - Xx - Xx']/a) 


and lies on the hyperbola if 

[x — Xx -\- Xx'Y/a^ b'^ [x — Xx — Xx'Y/aVy^ i, 
i.e. if (4 jc.t 7«^)(A — A^) ' i. 

This quadratic equation has tw(j solutions, A, (corresponding to P) 
and A 2 (corresponding to P') - 
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y XX^ 

MP'^ X^MM\ 

But Ai + Ag = I, so that MP + MP' = MM', and 

MP = MM'—MP'= P'M'. 

This property' can be used to construct a hyperbola, if its asymptotes 
and one point P' on the curve are given. 

We draw any line through P' cutting the asymptotes at M and M', 
and find the point P where MP == P'M'. P is then also on the curve. 
By repeating the construction with different lines MP'M' we can find 
as many points as we like. 


PROBLEM 


(i) Draw the hyperbola with asymptotes= ± A.v passing through 

^ equation, and show that it passes through (± -J, 
zb fl) and {± ±1). What are its foci, eccentricity, and directrices? 

5.18 The rectangular hyperbola 

If ^7 = ■ the asymptotes of the hyperbola are the lines r = zb 
which are at right angles. T’hc cur\e is then known as a “rectangular 
hyperbola and has equation co-ordinates 

r~ (cos 0 )~ — r- (sin 0 )- = a~, 

that IS, by (5.13), r- cos 2^? = tj-. This equation takes a specially simple 
iorm il we take new co-ordinate axes parallel to the asymptotes, that 
IS, at an angle ol 45 to the old axes, 'llus change of a.xes is most 
easily done m polar co-ordinates, where it amounts simplv to increasing 
'' 45 • I bus it 0 IS the new value of the polar angle, Q = 0 45°, 

or 0 0 _ 4C5 equation of the hvperbola cos 20 — 

becomes r- cos 2 (© - - 45 ) ^ r- cos (2 0 — 90 ) = r- sin 20 = 

2r sm 0 . r cos 0 or if {X, Y) arc the co-ordinates in terms of the 

new axes, A 1 ' =. hi- --- constant. It is in this form that we previously 
encountered the equation (Section 3.9). 

5.19 Conics as a family 

We have seen above that the circle, ellipse, parabola and hyperbola 

c.in ull be considered as sections of a cone, and classified according to 

r (-ccentiicity c. 1 here remains <)ne other possible form of a 

conic section: it the cutting plane passes through the vertex we obtain 
a pair ot straight lines. 
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It can be shown that a conic section in any position in the plane 
always has a “second degree” equation, i.e. one of the form 

+ k 2 y ^ + *3^ + *4^ + + ^6 = 0 

where the ^’s are six constants, and k^, ^3 are not all zero. Con¬ 
versely any equation of this form represents a conic of some type. 
Unfortunately the proofs of these results are rather long, as also are 
the methods for finding the axes, foci, etc., of a conic given by such 
an equation, and we shall not discuss them here. See, for example, 
D. M. Y. Sommerville’s Analytical Conics (2nd edn., 1929, Bell). 



CHAPTER SIX 


LOGARITHMS 


6.1 The equiangular spiral 

Imagine a person P who walks in such a way as to keep a certain land¬ 
mark O always at a fixed angle 4 > to his right, then the path he traces out 
is called an “equiangular spiral S with centre O and angle 9S” (Fig. 6.1), 
In other words the spiral cuts every line through O at the same angle 



Fig. 6.1—An equiangular spiral with centre O and angle 

Owing to the structure of their eyes certain insects fly towards a 
light O in such an equiangular spiral, and not directly in a straight line. 

I he curv'cd markings to he found on shells are often equiangular spirals. 
'1 he straight line and circle are special cases of an equiangular spiral— 
for the straight line 0 -- o and for the circle <3^ = 90 . We shall how¬ 
ever ignore these special cases, and consider in what follows only true 
s{>irals lor which 4 > lies between o® and 90 . 

I ake one such spiral with centre O, and passing through the point I, 
where the distanc'c* OI is i unit ( 1 -ig. 6.1). Fet P be anv point on the 
spiral, write OP . r, lOP 0 , s(j that r and 0 are polar co-ordin¬ 
ates oi P. 'i hen we shall define the angle 0 to he the logarithm of r, and 
r to l>e the antilogarithm of 0 , and W’c shall write 

0 • log r; r - 
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antilog 0 . 


(6.1) 
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It follows immediately, on letting P take the position /, that 

o = log I; I = antilog o . . (6.2) 

In this definition of a logarithm we must be careful to distinguish 
angles 6 equal to 0° and 360^. If P is at /, then we say 8 = /_ lOP = o'’ 
If P runs along the spiral away from O until it has gone completely 
round O, and returned to a point on OI produced, then we shall say 
that 8 = 360®, and not 0°. If P again runs along the spiral until it has 
gone round O a second time, then we shall say that 8 — 720®. 

Since the radius r is always taken to be positive, it follows that only 
positive numbers have logarithms according to this definition. However 
It can be shown—-we shall not go into the proof here—that every positive 
number r has a logarithm, and every angle 8 has an antilogarithm. The 
reader can easily imagine that this is so from the general shape of the 
spiral, which winds round its centre O an infinite number of times. 

6*2 The addition theorem 

Now let IP be the part of the spiral lying between the radii OI and 
OP. Imagine the triangular figure lOP magnified in a constant ratio R, 
so that OI becomes OI' of length R, and OP becomes OP' of length rR, 
while the arc IP becomes an arc I'P' (Fig. 6.2). Since magnification 



Fig. 6.2—The addition law for logarithms 


does not alter angles, it follows that every line through O will cut I'P' 
at the same angle ^ as it cuts IP\ in other words I'P' is part of a spiral 
with centre O and angle Now let M be the point on the original spiral 
at distance OM — R from O. 'I'hen if we rotate the figure POP' about 
O until I' falls on M, the arc PP' will fall on an arc MN of the original 
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spiral, with ON = rR. The angles Z_ MON and /_ I’OP' will be 
equal, so that 

Z ION = Z lOM -h Z MON 
= Z lOM + Z lOP. 

But Z ION = log ONy by definition, = log ri?; Z lOM — log OM 
= log i?, and Z lOP — log r. Thus 

log rR = log 4 - log r . . (6.3) 

The logarithm of a product is the sum of the logarithms of its factors. 

Alternatively we can express this same fact in terms of antilogarithms. 
Let Z lOP = 6, /_ lOM — ©, then Z ION = 0 -f 0. Therefore we 
have r = antilog 6^ R= antilog 0 and rR — antilog (^ + ©), so that 

antilog 4- 0) = antilog $ . antilog 0 . (6.4) 

The antilog of a sum is the product of the antilogs. 

We can also use this fact to express the definition of an antilogarithm 
in a slightly different form. We have Z MON = Z lOP = while 
ONjOM = rRjR = r — antilog 6 . Thus if we take two radii to the 
spiral, OAf and ON, at angle 0 = Z MON apart, the ratio ONjOM 
is the antilogarithm of 6 . Expressed in this way as a ratio dependent 
on an angle the antilogarithm is seen to be analogous to a trigonometric 
ratio—a point we shall return to later. For example, corresponding 
to the formula antilog (a 4* / 9 ) = antilog a . antilog ^ we have the 
formula cos (a + ^) = cos a . cos ^ — sin a . sin /S, which begins in 
a similar way but has an additional term. 

6.3 Logarithms as aids to calculation 

One of the chief uses of equation (6.3) is of course the replacement 
of multiplication by addition. 

For example, suppose we want to multiply 1-23 X 2*34. We have 
log (1-23 X 2-34) = log 1-23 4- log 2-34, and—if we may anticipate 
Section 6.4 by using a table of 4-figure logarithms—we see that 

log 1*23 = 0899 
log 2 34 = -3692 

log (1-23 X 2-34) -4591 

whence 1-23 X 2-34 = antilog ‘4591 = 2*878 (from a table of anti- 
logarithms). By direct multiplication we find that 1-23 X 2*34 = 
2-8782, so that the answer is correct to four figures. 

Since multiplication is done by addition of logarithms, the reverse 
process of division will be equivalent to their subtraction. W’e can 
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prove this formally in this way: in equation (6.3) put Rr = Sy so that 
R = s/r. Then the equation becomes 

log s = log {sjr) + log r 

or on taking log r to the other side of the equation 

log {sjr) = log J — log r . . (6.5) 

The logarithm of the ratio of two numbers h the difference between 
their logarithms. 

Alternatively we can deduce this property directly from Fig. 6.2. 
We have seen that log (OA^/OM) = 6= /_MON = /_ION — 
Z. lOM = log OiV — log OM. Expressed in terms of antilogarithms 
the property becomes 

antilog (a — /3) = antilog a/antilog ^ . (6.6) 

The proof is left to the reader. Thus to find 2-34/1-23, we use the 
equation log (2-34/1-23) = log 2-34 — log 1-23: 

log 2-34 = -3692 
log 1-23 = -0899 

log (2*34 /i' 23) = ‘2793 

2-34/1-23 = antilog -2793 = 1-902 to four figures. In words, we sub¬ 
tract the logs and take the antilog of the difference. 

This method of multiplying and dividing is especially useful when 
we have many factors. In general we have 

log ■ ■ •) = log A + log B + log C I- ... 

— logtf — logZ> — ... (6.7) 

as follows from repealed application of formulas (6.3) and (6.5). For 
example, to prove that log {Alija) -- log A i- log li — log o, we pro¬ 
ceed as follows: 

log (ABja) ■' log Afi — log «... [by (6.5)] 

= log A 1 log H - log a . . [by (6.3)] 

e.g. log (1-44 X 1-25/1-20) = log 1-44+log 1-25—log 1-20 

= -1584 + -0969 — 0792 = -1761, 

whence 1-44 x 1-25/1-20 — antilog -1761 - 1-500. As particular cases 

of (6.7) we have log {A'^) = log (AA) = log A -j- log A = 2 log A; 

and log (A^) — log (AAA) = log 2I 4 log ^ + log 2I = 3 log A. 

In general 

log (A”) - ^ n . log A 


( 6 . 8 ) 
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or expressed in terms of antilogarithms, 

A** = antilog (n. log A) .. . (6.9) 

or putting log A = since then A = antilog a, 

(antilog a)" = antilog na . . . (6.10) 

This formula enables us to find squares, cubes, and higher powers 
of a number very quickly. To square a number, double the log and take 
the antilog of the result. E.g., (1-23)2 — antilog (2 log 1*23) = antilog 
(•1798) = 1-513 to 4 figures. (Actually 1-232 = 1-5129.) Since taking 
a square root is the operation inverse to squaring, one would expect that 
it could be done by halving the logarithm. This can be easily shown as 
follows: 

VA , ^ A 

Therefore logV^ + logV^ = log A 

or log VA = h log A . . . (6.11) 

or alternatively -^A = antilog (i log A). 

Thus = antilog (i log 1-513) = antilog -0899 = 1-230. 

Similarly a cube root can be evaluated by dividing the logarithm by 3, 
and taking the antilog. 

Furthermore since log i = o, log (i/A) = log i — log A 

= — log A ... . (6.12) 

T hus to find the reciprocal i/A of a number A, change the sign of the 
logarithm and take the antilogarithm. In the same way we have in 
general 

Iog(i/^») = — «log .4 . . . (6.13) 

Thus it is easy to evaluate an expression like AB- \/C/ai}^, where A, 

C, a, and 6 are known quantities; for 

log (AB- \ O/ab^) = log A -r 2 log B -h ^ log C — log a — 3 log A. 

6.4 Common logarithms 

So far we have given a very general definition of a logarithm in 
terms ot an equiangular spiral, and have not shown how we can calcu¬ 
late values of logarithms, and so prepare a table of logs for practical use. 
Before we do this we must first explain what is meant by a “system’* of 
logarithms. 

1 he logarithm of the distance OP, where P is a point on the spiral, 
is defined as ^ /OP. 

Now’ the numerical value of the logarithm obtained from this defin¬ 
ition depends on two factors, namely the unit of angle in terms of which 
^OP is measured, and the angle ^ of the spiral. If we alter these we 
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obtain a different system of logarithms, but so long as we stick throughout 
to a single system—any system will do—all the formulas we have given 
above are valid. 

The effect of changing the unit of angle will simply be to multiply 
the numerical values of all logarithms by a constant. Thus suppose we 
took a table of logarithms and doubled all the entries in it, we should 
still have a perfectly usable table, but it would be a different system. 
In fact, if we take the basic equation log AB = log A + log By and 
multiply throughout by any constant k, we obtain 

k log AB = k log A + k log B 

so that the equation will still remain true if all logarithms are multiplied 
by k. It is at first sight more difficult to judge the effect of changing the 
angle ^ of the spiral. In fact this has the same effect as changing the 
unit of angle: it multiplies all logarithms by a constant factor. But the 
simplest way of proving this fact involves calculus, and must be tempor¬ 
arily postponed. To summarize: there are many different “systems” 
of logarithms, corresponding to different choices of spirals and different 
units of angle in the definition of logarithms. But the only difference 
between two such systems is that one is a constant multiple of the other. 

Now a very useful system is that in which the unit of angle is adjusted 
so that the logarithm of lo is i, i.e. so that antilog i = lo. Such log¬ 
arithms are called “common” or “Briggsian” logarithms, as they were 
first calculated by Briggs (1561-1631). (If we have tables of logarithms 
to any other system, we can readily convert them into common logs by 
dividing them throughout by the value of log 10 in that system.) The 
importance of the Briggsian system is as follows: since log 10 = i, it 
follows that log 100 = log 10^ = 2, log 1000 = log lo'* = 3, and so on. 

Now consider the equations 

12-3 = 10 X 1-23 

123 = 100 X 1-23 

1230 = 1000 X 123 

In terms of logarithms these become 

log 12 3 I log 1-23 
log 123 = 2 -f- log 1-23 
log 1230 = 3 1 log 1-23 

or since log i-23 = 0899, we deduce at once that log 12 3 = 10899, 
log 123 = 2 0899, *230 = 3 0899. In short, each time we shift 

the decimal point one place to the right the logarithm is increased by i. 

Thus if we can construct a table of logarithms for all numbers 
between i and 10, then we can find from it without trouble the logar¬ 
ithms of all greater numbers, where the decimal point has been moved 
a certain number of places to the right. 'I'hc fractional part of the log¬ 
arithm, i.e. the part following the decimal point, sometimes called the 
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“mantissa”, is not affected by moving the decimal point, and is deter¬ 
mined from the tables. Since log i = o, and log lo = i, it is reasonable 
to suppose that if A is any number between i and lo, then log A lies 
benveen o and i. (This is indeed almost self-evident from the defin¬ 
ition, but a form of proof can be given by calculus methods.) Thus if 
A lies between lo and loo, then log A lies between i and 2, and has 
integral part equal to i, i.e. the figure before the decimal point in the 
logarithm is i. (The integral part of a logarithm is also sometimes called 
the “characteristic”.) If A lies between loo and looo the integral part 
is 2, and so on: i.e. in calculating log A we find the integral part as one 
less than the number of figures to the left of the decimal point in Ay and 

fractional part is derived from tables, ignoring the position of the 
decimal point. Thus log 1230 has integral part 4 — 1=3, and frac¬ 
tional part = log 1*23 = -0899, and is therefore 3-0899. 

We can now show how to find log A for any number A >1. As 
an example, we shall calculate log 2 to 3 places. 

Firstly we see that since i < 2 < 10, log 2 lies between o and i. 
Now calculate 2'°, thus: 2- — 4, 2^ = (2^)“ = 4“ = 16, 2® = (2^)“ = 
16^ = 256, 2^® = 2- X 2® - - 1024. From this we see that since 2'® lies 
between 10® == 1000 and io‘, log (2^®) — 10 log 2 must lie between 3 
and 4, and so log 2 must lie between -3 and -4. To find the next figure 
in log 2 we must first go one step further, and calculate 2^®®. This is 
not so formidable an undertaking as it sounds, since we only need to 
keep to about four-figure accuracy. Starting with 2'® = 1024 — 1-024 
X 10®, we have 2®® - --(2‘®)- = I 049 X 10® ; 2"*®-= (2®®)-= i-ioo X 10^-; 
280 ^ (2>o)j 1-210 X io®‘; 2*®® = 2^® X 2®® = 1-269 X 

therefore lies between 10®® and io®h Thus log 2*®® = 100 log 2 lies 
between 30 and 31, and log 2 between -30 and -31. Finally to get the 
third figure we calculate 2^®®®: for this it is enough to keep only 3 
figures in the calculations. 2^®® = (2'®®)-= 1-61 X 10®®; 2'*®® = 

(2-®®)^ = 2-59 X lo'-®; 2»«® -- 6-71 X io 2 *®, and 2'®®® = 2-®® X 2®®® 
— 1-08 X io®®h 'rhus 301 < log 2^®®® < 302, and -301 < log 2 < 
•302. If we kept more figures throughout wc could find log 2 to 5 places 
to be -30103: the process can be carried out expeditiously using a 
calculating machine. ('I'his process also enables us to calculate log .v to, 
say, 6 places using 4-place logarithms: it is only necessary to calculate 
.v’®" to 4 figures, take its logarithm, and divide by 100.) 

Alternative notation for common logarithms 

In any system of logarithms, the value of antilog i is known as the 
“base” of the system. '^I'hus common logarithms may be called 
“logs to base 10” and are sometimes written log,0-4. In this book 
we shall restrict the use of the notation “log . 4 ” to common log¬ 
arithms, except where otherwise stated, so there should be no danger 
of confusion. All the logarithms used in Section 6.3 were common 
logarithms. 
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PROBLEM 

(1) Use this method to calculate log 5 to 2 figures, and verify that 
log 5 + iog 2 = log 10 = I. 

6.5 Logarithms of fractions 

Since log i = o, log A is negative when A is less than i. For 
example, log '5 = log i = — log 2 = — -3010 to 4 figures. 

It is> however, usual to write this differently- We have already 
shown that the logarithms of 5, 50, 500, etc., do not differ in their frac¬ 
tional part. They are log 5 = *6990, log 50 = 1*6990, log 500 = 
2*6990. If we wish to write log *05 with the same fractional pi^t, we 
must say log *05 = log tctj = —2 + log 5 = —2 -H *6990 = 2*6990 
using the symbol 2 to mean — 2. In the same way, log 005 3-6990 

written with positive fractional part, and negative integral part, as is 
the custom. By writing logarithms in this way we can keep the con¬ 
vention that the fractional part of the logarithm can be derived directly 
from tables, independently of the position of the decimal point in the 

original number: e.g. 

log 516 = *7126 (from tables), 

whence log 51 *6 = ^*7126 
log *516 =1*7126 
log *0516 =5 2*7126 

If the first non-zero figure is lUc first after the point, then the integral 
part of the logarithm is I; if the first non-zero fignire is the secofid after 
the point, then the integral part is 2, and so on. Such a notation can be 
easily converted into the form log *516 - -*2874, if desired, for 1*7126 
= —14, .7,26 = — (i — *7126) - — *2874. Similarly log *0516 

2*7126= — 1*2874. 'I'his transformation is, however, rarely needed. 


E X A M P L K s 

(i) Find *123 X *0234 by logs. 

We have log *123 = ^ 0899 

log *0234 - 2*3692 


log (*123 X 0234) = 3 4591 


•123 X *0234 = antilog 3*4591 = *002878. In doing thc^ adibtion the 
fractions are added as usual, and the integral parts thus, i i- 2 3. 


• These could also be written us z 6990 and r-690O, usinti the inverted ili«its 
* and C for — 2 and —3 respectively, utcordirit' to llie notation su^^gested in C hapter 2... 


I 
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(2) Find •oi23/*234 by logs. 

We have log ‘0123 = 2*0899 

log *234 = 1*3692 


By subtraction, log (-OI23/-234) = 2*7207 

whence ■0123/ 234 = -05256. In doing the subtraction we first deal 
with the parts following the decimal point in the usual way, obtaining 
difference *7207 with i to carry. For the integral part to the left of the 
decimal point, i (carry) +2 — 1=2. 

(3) Find \/-i 23 by logs. We have log -123 = T 0899, whence 
'v/; 123 = ^ X 1*0899. The simplest way of halving I 0899 is to 
write it as (2 + 1*0899), whence i x T 0899 = ^ x 2 + i X 10899 
= I + *5449 = 1*5449, and taking the antilog, ^*123 = *3507. 

These operations with numbers which are partly negative, partly 
positive, can be considerably simplified by the more radical method of 
using Colson’s notation (Chapter 22). 

6.6 Logarithms of negative numbers 

Often wc wish to do calculations with negative numbers which would 
be very well suited for the use of logs, were it not for the difficulty that 
negative numbers have no logarithms. We can overcome this difficulty 
by this device: we use for the negative number (— x) the same logar¬ 
ithm as for the positive number .v, but mark it with an “N” to show that 
• logarithm ’ of a negative number. Thus, since 2 has the logar¬ 

ithm *3010, - 2 will have the “logarithm” N*3oio, according to this 
convention, and — 200 will have the logarithm N2*30io 

If then we wish to multiply 12-3 X (-2 34) x (-4 32) X 

(--32*1) we could set out the work thus: 

log 12*3 = 10899 

log (— 2*34) = N *3692 
log (— 4-32) = N *6355 
log (~ 32*1) = NT*5 o65 


Total = N3*6 oii 
A ntilog 2^ — 3991 

^ In adding the logarithms we are really multiplying the positive num- 
bers 12*3, 2 34, 4*32. and 3*21 to get the product 3991. But since three 
of the logarithms are marked with an X this shows that three of the 
numbers we wish to multiply are actually negative, so that the product 
of them IS also negative. I he total logarithm is therefore marked with 
an N, and Avhen we take its antilog we give it a minus sign. In general, 
if we add or subtract an odd number of N-logarithms, the result wdll be 
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an N-Iogarithm: but if two, or any even number of N-logarithms are 
added or subtracted the N’s will cancel and the result will be an ordinary 
logarithm. 

EXAMPLE 

(1) Multiply ( — 12-3) X (— -234) 

log (— 12*3) = N10899 
log {— -234) = NI-3692 

Total = 4591 

Antilog = 2878 

The product 2-878 is positive since it is the product of two negative 
numbers: and therefore its logarithm is not marked with an N —the 
two N’s cancel on addition. 

In the same way an N-logarithm multiplied by an odd number 
remains an N-logarithm; but if multiplied by an even number it 
becomes an ordinary logarithm. 

EXAMPLE 

(2) Find (— 1-23)*. 

This is positive, since the fourth power of any number is positive. 

Now log (— 1-23) = N 0899 

4 

log (- 1-230 == 3596 

Taking antilogs, (— i 23)* = 2 389 

Finally we must notice that since a negative number has no square 
root we cannot halve an N-Iogarithm. 

^•7 Addition and subtraction logarithms 

While logarithms are very convenient for multiplication and division, 
unfortunately they make addition and subtraction correspondingly 
difficult. Sometimes in doing a calculation by logs we may want to know 
the value of log (:r y), where we know’ log x and logy. For example, 

vve might want to calculate i we can readily find 

log a® = 2 log a and log = 2 log Z»; how’ are we then to find log 
(d^ 4-6*)? One method w’ould be this: to find log {.v -\-y) knowing 
log X and logy, first find x from log x by using the table of antilogarithms, 
also find y from log y. Addition gives us the value of {x 4 - y), and we 
then take its logarithm. 'I bis method is very clumsy, and requires no 
less than 2 references to a table of antilogarithms, and one to a table of 

logarithms. 
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It is possible to reduce the work to a single reference to a table, 
together with one subtraction and one addition, by the following device: 

log {x -f- y) = log [jc (i + y/x)] 

= log 4- log (i -|- y/x) 

= log + log (i + antilog [log>' — log x]). 

Let us put log a; — log y = m, a known quantity (since log x and log y 
are known). Then this equation becomes 

log {x +y) = log a; + log (i -f- antilog [— w]) 

Now the expression log (i -f- antilog [ — w]) is an expression whose 
value depends only on the value of m, i.e. it is a function of u. It is 
called the “addition logarithm of and (since there appears to be no 
generally recognized notation) we shall denote it by “adlog u**. Its 
value can be tabulated once and for all, and we can then say 

log (.V 4 - y) = log a: -f adlog u 

where u = log a: — logy. In the same way 

log (.V — y) = log a: + log (I — yjx) 

— log a: 4 - log (i — antilog [— w]) 

so that if we call — log (i — antilog [ — w]) the “subtraction logarithm 
of u*’ or “sublog h”, then 

log (a; — y) = log x — sublog u 

where u as before is log .v — log v. 

Several books ot tables give values of addition and subtraction log¬ 
arithms: a good table will be found in Milne-Thomson & Comrie’s 
Staridard Four-Jigiirc Mathematical Tables (1931, Macmillan). 


E X A M P L E 

(i) Given log .v - - -4771 and log y -3010, find log (.v 4 - 3 ')* 

ii ^ - log .V — log 3' = -1761 
From tables, adlog ti -- -2219 
Whence log (.v 4- y) --t log a: -- adlog u 

= 6990 


both adiiitioii and sulitraction logarithms can be brought under a 
single procedure in the following way. Subtraction can be looked upon 
as a ]iarticu!ar case ot addition of a negative number, so that (.v 4- 3') can 
be loulo d upon as (.v [ - v]). We can therefore consider the problem 

tpjiie gciierally as this: to find log (.v 4-3’) given log .v and log3’, where 
.V and V can be jiositivc or negative numbers. If at and v are positive both 

logarithms will l^e true logarithms; if not, one or both will be N-logar- 
ithms. 
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We shall suppose that log > logy (ignoring the N’s, if any). If not, 
X and y must be interchanged in what follows. First find u = log x — 
logy: u will then be a positive number, with possibly a prefix N. Look 
up the addition logarithm of w, adlog ti, in suitable tables. (There will 
be different tables according as u has or has not a prefix N; the table 
with the N prefix is equivalent to subtraction logarithms, but has the 
sign changed for convenience.) Finally calculate 

log (jc + y) = log ^ + adlog w . . (6.14) 


FURTHER EXAMPLES 

(2) Given that log x = 1-4771 and log y = 1-3010 find log (x + y) 
and log (x — y). 

M = 1-4771 — 1-3010 = -1701 

From tables adlog u = -2219. log (x + y) = i- 477 i + -2219 = 
1-6990. Also log (x -y) = log (x +y') w'here y' = - y, log y' = 

Ni-30io. 

ti' = log X — log y' 

= 1-4771 — Ni-3oio 
= N-1761 = N«. 

From tables adlog m' = — *477 * 

/. log (x — y) = log X 4- adlog ti 

= 1-4771 — -4771 = I-OOO. 

(3) Given that log x= 1-1234 and log y=^Ni-i22i, find log 
(x 4 - y). 

n = log X — log y 

= 1-1234 — Ni-i22i — N-OOI3 

From tables, adlog w - - 2-53, so that log (x 4 -y) = i-i2 - 2-53 

= — 1-41 

Note that adlog u is only given in the tables to 3 figures, so that log 
(x 4- y) has only 3-figure accuracy. We can sec the reason for this if we 

work out X and y directly. 

^ = antilog 1-1234= 1328 

y = antilog Ni-i22I — 13-24 

X -\- y ---- -04 

In the addition (x 4 - y) most of the digits cancel, and in fact (x 4 y) has 
only an accuracy of i significant figure, and log (x 4 y) is only accurate 
to 2 places. Accordingly to give any further figures in the addition 
logarithm would merely give a spurious appearance of accuracy. 

The use of logarithms as devices to simplify complicated calculations 
is now being largely superseded by the use of calculating machines. 
With modern machines, addition and subtraction logarithms become 
completely unnecessary. 
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6.8 Logarithmic comparisons 

In adult human beings height ranges roughly between 1-3 and 19 
metres, an overall range of *6 metres, or 60 cm. In flying-fish, however, 
the range of length in adults is roughly between 7 and 12 cm, a difference 
of 5 cm. [A. Fr. Bruun, Flying Fishes {Exocoetidae) of the Atlantic^ 
Dana Report No. 6 (1935), Carlsberg Foundation.] Thus the vari¬ 
ation in men is much greater than the total len^h of a flying fish. This, 
however, is scarcely a fair method of comparison. A more reasonable 
measure would be the percentage variation—or let us say the ratio of the 
greatest and least values. For man, the ratio of the greatest to the least 

is—- s= 1-46; for the fish it is — = 1*71. We can indeed rea- 

13 7 

sonably expect most species to have ranges of variation of this order 
of magnitude, when expressed as a percentage or ratio—although when 
expressed in absolute rather than relative units the ranges will be very 
different. 

Note: such an idea of a “range of variation”, though very convenient, 
is difficult to express precisely. If we limit ourselves to the normal adult 
population, then by far the greater proportion of heights will be between 
13 and 1-9 metres. But in a sufficiently large population there will 
certainly be a few giants and dwarfs outside these limits. Is it really fair 
to include these exceptional individuals who occur so very rarely? If 
one does not, where is the line to be drawn between those included and 
those e.xcluded? If all individuals are supposed included, how can one 
be sure that some exceptionally short or exceptionally tall person has 
not been overlooked? We shall return to this point later (Section 20.4); 
for the present a fairly vague definition of the range as including almost 
all the population will be sufficiently adequate. 

'I'his idea of “proportional range” can readily be e.xpressed in logar¬ 
ithmic form. For if .v is (say) the least value of some measured quantity 
in a population, and A' its greatest value, then log (A'/.v) = log X — 
log .V. If A'.'.v is, say, i -5 -that is, the greatest value exceeds the least by 
50 per cent, then log A' — log .v = log 1*5 — -176. But log X is the 
greatest value of the logarithm of the measurement in question, and 
log A- the least: so that log X — log .v is the range of the logarithm. For 
example among human beings very few exceed a height of 1-9 metres, 
which has a logarithm i-zS, and very few are shorter than 1*3 metres, 
with U)garithm i-i 1. d’he range of log heights is accordingly about -17. 

W’e can expect a very similar range for other characters both in man 
and in most other species- -although there will be a few special cases 
where the variation is exceptionally wide or exceptionally narrow. 

'There are a number of other cases in which it is better and simpler 
to consider the logarithm of a measurement or number instead of the 
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measurement itself. Take, for example, a colony of bacteria growing in 
a medium which provides them with an abundance of food. 1 hen each 
bacterium will split into two in its reproduction time of, say, i hour. In 
one hour the number of bacteria will be multiplied by 4, in i ^ hours by 
8, in 2 hours by 16, and so on. The growth will continue in this ex¬ 
tremely rapid fashion—a “geometric series”—until overcrowding or 



Fig 6 a_The theoretical growth curve for a bacterial population, 

’■* plotted logarithmically 



exhaustion of food causes it to slow down. It is difficult to form a clear 
mental picture of such a rapid growth, which midt,pl.es the colony a 
thousand-fold in less than 5 hours, and a million-fold in under 10 hours, 
and it is still more difficult to represent it adequately by a graph. But 
the logarithm of the number of bacteria has a steady rate of increase of 
log 2 — *3010 every half-hour, or 6020 per hour: and plotted against 

the time it will give a straight-line graph. (Fig. 
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A graph can often be made straight by plotting the logarithm of one 
measurement against the logarithm of the other, instead of using the 
original measurements as co-ordinates. Thus if we plot the logarithm 
of the weights of schoolgirls against their heights, using the data of 
Section 3.5, we obtain the graph of Fig. 6.4, which is practically straight 
except at the lower end. 

Other examples of a linear relation bet\veen logarithms are: 

(1) Dreyer’s relationship between the weight JV of a. person (in 
kilograms) and the sitting height A (in metres); 

log W = 1-9532 + 3 'I 3 S log ^ 

(2) Du Bois*s relationship between the surface area S of the body 
(in square metres), the height H (in metres), and the weight W (in kilo¬ 
grams) : 

log S = — *6937 H- -425 log W + -725 log H 

Linear equations of this kind connecting the logarithms of body 
measurements are known as “allometric” or “heterogenic’* relations. 
They are of frequent occurrence in connecting both the changes in 
body measurements as an organism grows to maturity, and also in 
relating the different values of these measurements among the separate 
individuals making up the adult population. Strictly speaking the 
relations will be between average values rather than individual values— 
we cannot expect each individual to obey the relation exactly: but that 
is a point we shall return to later (Section 21.10). 


6.9 Fractional and negative powers 

An important special case in which there is a linear relation between 
the logarithms of two variables .v and y is that in which one is propor¬ 
tional to some power of the other, say 

y = Kx^ 

whence 

log y = log K B log A- 

Conversely, if a and y .ire positive variables related by an equation of 
the form 

log 3' — A B log A 

then if i? is a positive integer we can write this as 

y — Kx^ 

vhere AT = antilog A. For since 


log y = ^ B log .r, 

y — antilog {A B log a) 

antilog A . antilog {B log x) [by (6.4)] 
— antilog A . a° [by (6.9)]. 
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But if B is not a positive integer, we cannot replace antilog {B log .v) 
by jc®. For a:® is usually defined as the product of B x's, and according 
to this definition there is no sense in talking about .r® unless 5 is a 
positive integer. Now this restriction is rather an awkward one, and so 
mathematicians find it convenient to replace the old definition by a new 
one: 

Definition : If x. is any positive number, and B any real number, then 
the symbol x® means antilog (B log x); i.e. 

log a:® = B log a: . . . {615) 

for any value whatever of B. 

The restriction of x to positive values is necessary because otherwise 
log X would be an N-logarithm, and not a proper logarithm, and we 
have not given any rule for multiplying an N-logarithm by any number 
B (other than an integer). In fact we have shown that an N-logarithm 
cannot be multiplied by -i. (There is one exception to this: we can 
use the above definition of x® when x is negative and B is an integer, 
and it will still agree with the ordinary meaning of x® if B is positive.) 

The first property we must verily in this new definition is that it 
does not conflict with the old one. If B is a positive integer, then both 
definitions are applicable, and equation (6.9) shows that both give the 
same answer. 'I'hus the new is simply an extension of the old. It is, 
moreover, a most useful extension, for, as we have seen, it enables us 
to express the general “ allometric ” or logarithmic straiglit-line relation 
log ^ ^ B log X in the compact form 

y = KX^ .... (6.16) 

where K = antilog A. [But the reader should note that equation (6.16) 
is really neither more nor less than a shorthand way of writing the 
relation log y = A + /i log x, and this may sometimes with advantage 
be written in full.] 

Now the symbol x®, using the old definition, was subject to certain 
rules of operation, such as x"". x" = x"’-‘". Is the new definition subject 
to the same rules? “^i he answer is yes: for example since 


and 


log .V” ^ m log X 

log x" = n log X 

by definition, we have on addition 

log X"* + log x" - (m -I «) log X 

or 

loK (x"‘. X") - log (x'"*") 

or on taking antilogs 


X’". X’' 


.V 


m 4 n 


■ (6-17) 
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quite generally, for all values of m and n. Again 


log x”* = m log X 
log = m log y 


whence by addition 


log (a;*" jv”*) = m log xy 

or 

x”* y”* == 

Finally 

log a:*”" = mn log x 


— m (log a:”) 
= log (.v")”* 

so that 



. (6.18) 


• (6.19) 


Thus the usual rules of operation with indices hold without alteration. 

It is also important to notice that although the new definition (6.15) 
of involves the use of logarithms, it does not matter which system 
of logarithms is used. All systems give the same final answer. For one 
system of logarithms differs from another only by a constant factor, 
so that if the equation log a*® — B log .v defines the value of .v® in one 
system of logarithms, then (multiplying both sides by a constant k) we 
have k log x^ ~ B. k log i.e. the equation remains true in any other 
system. 

The definition x^ = antilog {B log x) gives us a direct method of 
calculating x^ for any value of B. 

Query. What does this definition mean expressed in terms of an 
equiangular spiral? 


EXAMPLES 

(1) The formula C ~ *9705 relating the chest circumference C 

and sitting height A, both measured in metres, is a modification of a 
formula given by Dreyer. If A = 1-05, what is the value of C given by 
this formula ? I’he formula is equivalent to 

log C = log -9705 -f 1 • 1 44 log A = 1-9870 + I -144 log A . (6.20) 

If A = 1-05, then log A = -0212. Therefore log C = 1-9870 + -0243 
= -0113, C -- 1-027 metres. 

(2) Invert the formula of example (i) to express A in terms of C. 

^'his is best done by using the formula in its logarithmic form (6.20). 
We then have 1144 log A = log C — T-9870 

= log C + I — -9870 
= log C + -0130 
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Dividing through by i‘i44 we have 

log A = '8741 log C + 0114 
or 

A = (antilog -0114) C 
= I 027 C 

(3) Find (1024)-^ h 

log(-i024)= 1*0103 and therefore log(-io24 *'^) = —4*1 X 1-0103. 
This multiplication can be most readily performed as follows: 

— 4-1x1 = + 4 '* 

— 4-1 X 0103 = — 0422 


Total = — 41 X I 0103 
Therefore -1024“^'^ 


= + 4-0578 

= antilog 4 0578 
= 1*142 X io‘ 


PROBLEMS 

(0 Meeh’s formula for the surface area S of the human body, in 
square metres, is 5 = w'hcrc IV is the weight in kilograms, and AT 

is a constant which for children is *103. It has been found by Benedict 
and Talbot that if A is the length of an infant (in centimetres) the amount 
of heat produced by it in 24 hours is *1265 LKW^ calories. How much 
heat should an infant produce whose weight is 3'^^3 kilograms, and 
whose length is 52 cm. ? 

(2) Dreyer found the following relationship between W (weight in 
grams) and A (sitting height in centimetres) of a person: = 

•3803 A. Find the sitting height of a person weighing 89780 grams. 


Some important special cases 

By the definition we have = antilog (o log •*■) == * 

for all values of x (except x - o, since o has no logarithm). 1 his is to 
many people a somewhat surprising result. But the following argument 
shows that if the ordinary laws of indices are to hold then we must have 
*0 = I. Assume that .v"*. x" - for general values of m and n (as 

we know happens with our definition). 1 hen putting m o we have 

x’* = x'\ or .v** = I. . . 

Equation (6.11), \/x --- antilog (i log •*■). oan be written y^x — x . 
’I'his again is a natural consequence of the index law (.v ) == at , 

which on putting n =■ m =■ 2 becomes (-v^)^ - x, or x =■ y/x. In 

the same way we see that jc* is the cube root of -v, .v* is the fourth root, 
and so on. Since .v* {x^y^, we see that a* can be interpreted as the 

square of the cube root of x. Alternatively at* -= (a-)* = the cube root 

of X*. 
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Since log (I/at) — — log [equation (6.12)] it follows that = i/x. 
Similarly x-^ = [by the multiplication law (6.19)] = i/x^; 

jc-s = i/a- 3 ; and in general a;-” = ijx^. This gives a meaning to 
negative powers. Similarly = ijx^ = ij^x. 

In addition we have, since log 10 = i, 

antilog n = antilog {n log 10) 
s=s 10". 

More generally, in a system of logarithms to base b = antilog i we 
have antilog « = 6”, or « = log i”. This gives another definition of 
logarithms which is sometimes used: “n is the logarithm of x to base b 
if a; = b^”. The difficulty is that we cannot use this to define logarithms 
unless we have a definition of which does not involve logarithms, and 
such a definition is bound to be clumsy. 

6.10 The length of a circular arc 

If A and B are two points on a circle of centre O, then the length of 
the arc AB depends on the angle /_ AOB — 6 “subtended” by the arc 
AB at the centre O, and also on the radius r = OA of the circle (Fig. 
6.5). We can readily find a formula for the arc length in the following 
way. 



Let C, D, be points on the circle such that /_ BOC = /_ COD = B. 
Then since the sectors AOB, BOC and COD are all congruent, the arcs 
AB, BC and CD must be all equal. Now Z,AOC =16, arc AC = 
2 . arc AB‘, /_ AOD — -^0, arc AD = 3 . arc AB. If we double the 
angle 0 we double the arc, and if we treble the angle, we treble the arc. 
In general it is easy to see in this way that the arc is proportional to the 
subtended angle 0. Again if we double the radius of the circle, obtaining 
new points A’, B', where OA’ = zr = 2.0.,'l, OB' — ‘zr = 2.OS, then 
the sectors AOB, A'OB' are similar, and the arc A'B’ is double the arc 
AB. In general the length of the arc is proportional to the radius. 
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By combining these two facts we have the formula 

2txcAB~Hr6 .... (6.21) 

where // is a constant of proportionality. The value of H must depend 
on the units in which the angle 6 is measured. If B is in degrees, then 
we shall denote the constant by We shall show in the next section 

how we can calculate its value; to 6-figure accuracy it turns out that 

^360 = -0174533 

From this formula we can find the circumference of a complete circle, 
for that subtends an angle of 360° at the centre, whence 

circumference = Hgco • ^ • 3 ^® 

= 6-283i9r 

It is usual to call the circumference 2irr, so that it follows that n = 
3-14159 =: 180 Hseo- (The “2” in the formula “ 27 rr” seems to come in 
merely on traditional grounds: the fundamental number is really 277, 
rather than TT.) 

We can readily adapt these formulas to any unit of angle we like. 
Suppose we choose as unit i/wth part of the complete circle, so that a 
complete turn = 360^ = tn units. It follows that the length of an arc 
AB can be expressed in the form 

arc AB = H^rB 

where is the correct value of the constant H for these units. Faking 
the complete circle, tn units of angle, we have 

circumference = = 27 rr 

whence = 2Trjtn . . . (6.22) 

In this formula there is no need for tn to be an integer. 

Now for many purposes it is convenient to take a special unit of 
angle, called the “radian”, to make the constant //„ equal to i, and so 
simplify the formula for the length of an arc. From (6.23) it follows that 
we can achieve this by putting tn = 277, i.e. we take as our unit i/arr of 
a complete turn. 

I radian = i/arr turns 

(360/277) degrees 
= 57 2958 (i.e. just under 60'^). 

I complete turn = 277 radians = 6-28319 radians 

I = 277/360 radians 

- //300 radians = 0174533 radians. 

It also follows that if the angle B is measured in radian units, or in 
“circular measure” as it is often called, then the formula for the length 
of an arc becomes 

arc rB . . . . (6.23) 



126 


BIOMATHEMATICS 


In particular, H 8 = i radian, then the arc AB is r. Thus a radian can 
be defined as the angle subtended at the centre of a circle by an arc 
equal in length to the radius (Fig. 6.6) (whence the name radian = 

RADIUS ANgle). 



Fig. 5.6—The radian or circular unit of angle 

Arc AB = radius OA 

This definition however gives very little hint of the importance of 
a radian as unit of angle, which will become clearer when we come to 
calculus. We shall simply state here that it is so important that when¬ 
ever an angle is mentioned without any unit being specified, it is to be 
understood that it is measured in radians. Thus an angle means 
TT radians or i8o^, “Jtt” = 30°, and “sin 0” means the sine of 8 radians 
unless the contrary is stated. 

6.11 Functions of small angles 

For many purposes it is important to know the values of the trigono¬ 
metric functions sin 0, cos 8, tan^ and antilog 8 when 8 is very small. 
We shall first consider the functions sin 8 y cos 8 , and tan 8 . Draw the 
triangle OUP, with /_ POU = 0 , and a right angle at U (Fig. 6.7). Let 


P 



us call OP /■, OU .V, I 'P y as usual. Let us draw also the arc 
of the circle PI' through P meeting OU produced at I’. I'hen it is 
fairly evident from the figure that if 0 is small enough the straight line 
UP is practically iiulistinguishahle from the arc I 'P, which is of length 
Hrd (in general units: H - //3C0 if the angle is measured in degrees, 
and H i if the angle is measured in radians). 'I’hat is to say. 



. (6.24) 


UP ^ Hr 0 
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using the symbol for “is approximately equal to”.* Now sin 9 
— yjr by definition so that 

sin 9 i=siH 9 . . . (6.25) 

i.e. the sine of a small a?igle is nearly equal to that angle multiplied by H. 
We also see that .v = Ot/ = OK = r, so that 

cos 9 = xjr csi I 

tan 9 yjx Hr 9 jr = H 9 . . . (6.26) 

We have used above the rather vague term “is approximately equal 
to”, and the symbol cs to correspond. In fact we can express ourselves 
rather more precisely. We mean that the percentage error in taking 
sin 9 or tan 9 to be H 9 is small and the smaller 9 is, the smaller the 
percentage error will be; so the smaller 9 is, the nearer the ratios 
sin 9/9 and tan 919 will be to H. This is expressed mathematically in 
the form “sin 9 J 9 tends to // as 9 tends to o”, or in symbols 

sin 9 j 9 H as 0 -> o 

tan 919 -^ H as . . . (6.27) 

This fact can be verified from tables of sines and tangents, and at the 
same time the value of can be determined, and from it the value 
of 77 = 1807/3^0. 

Table 6.1—Sines and tangents of small angles 


6 (degrees) 

sin 0 

sin 01 $ 

tan 0 

tan 61 $ 

30 

•50000 

•01667 

■57735 

•01924 

20 

■34202 

•01710 

• 3^>397 

•01820 

10 

'I7365 

•01737 

•17633 

■01763 

5 

•08716 

•01743 

•08749 

•01750 

I 

•01745 

•01745 

•01746 

•01746 

•5 

•00873 

•01746 

•00873 

•01746 

•I 

•00175 

•0175 

•00175 

•0175 


• There does not seem to be any universally accepted symbol for approximate 
equality: the symbols ;5:, ===, are all used. 'I'lie symbol — is also used in 

certain cases, but this has’a special tecltnical meaning and should not be used indis¬ 
criminately, 

AlthouKh the above argument that the arc I'P and line (//* are practically identical 
»cemh plausible enouKh from a diuKram, some readers may feel that it falls short of the 
standard required of a mathematical argument. 'The answer to this objection is that 
we are here takinK ordinary Kcometric ideas, such as angles and lengths of arcs, for 
(granted, and developing their properties in a common-sense way: and from such a 
common-sense point of view the proof seems rejsonal>ly adequate. I'o make the proof 
absolutely watertight would mean a thorough-going analysis of ideas such as angle 
and length and would bring in complicjitions which are hardly appropriate to a first 
study of the subject, however important they may be later. 
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Thus when d is small, or less, sin ejd and tan efB are about 
•01746 to 4 places, and this must be the value of taking more 

accurate tables we can find -0174533, and in theory we can find 
-f^36o accurately as we wish, although this is not actually the most 
efficient way of calculating it. 

Again, in saying that cos 0 i, we mean that there is a very small 
percentage error in taking cos 6 to be i. In fact since 

(i — cos 9 ) (i + cos 0 ) = I — (cos 9 )^ = (sin 6 )^ 
we have gy 

' - ® = I + cos-fl 

and since i + cos 0 > i, it follows that 

I — cos 9 < (sin 9 )^ 

Now sin 0 is approximately {II 9 ), so that (i — cos ^), which is the error 
in taking cos 0 to be i, is less than about (i/ 0 )-, or (sin 9 )-: a very small 
number indeed, much smaller than sin 9 . This is borne out by the 
tables, which give cos i'' = *99985, and cos *1° = i-ooooo correct to 
5 places. Similarly sec 6 — i/cos 0 is nearly i, and cosec 9 and cot 6 
are jjH 9 to within a small percentage error. 

EXAMPLES 

(1) To find the height of a distant object, such as a tree, hill, tower, 
or chimney. If this has distance x, and subtends an angle 9 at the eye, 
then its height is jy = .v tan 0 , which is nearly enough xH 0 . Thus a tree 
with an apjiarent height of A® at looo metre distance has a height of 
h X 1000 X -01745 ~ ^'73 nictres. 

(2) A person of normal sight can read print which has an apparent 

height of iV . At 6 metres he can therefore read print of height 
600 i\t X 01745 = -873010. This is the principle of Snellen’s type 
for testing distant vision. The height of type just legible to a normal 
person at 6 metres is -873 cm or -35 inches. Print legible at 60 metres 
will accordingly have a height of 8-73 cm, and that legible at 18 metres 
a heiglit of 3 x -873 2-625 ^^n- ^ person who can just read at 6 

metres print which he should be able to read at 36 metres is said to have 
visual acuity 6'36, i.e. he can read print subtending an angle of 

X 13 — 2®. If we assume that distinguishing a man at a distance is 
comparable to reading a letter of type, then he can distinguish a man 
1-8 metres tall at a distance x metres such that 

I-So -- //3C0 i X .V 

whence .v ==: 200 metres. 

If the angle 0 is measured in radians our formulas become simply 

sin 9 tan 9 0 when 9 is small. 
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PROBLEMS 

(1) How far away can a normally-sighted person clearly distinguish 
a tower 10 metres high on a clear day? 

(2) If the moon’s diameter appears to subtend an angle of and 
the distance away is 3-8 x 10® metres, what is its actual diameter? 
What is the size of the smallest object one can expect to distinguish 
clearly on the moon? 

6.X2 Natural logarithms 

We shall now consider the value of antilog 0 for small 0 , including 
in our formulas the general system defined by an equiangular spiral of 
angle <f> and centre O. Common antilogarithms will be a special case. 
Let I be the point on the spiral such that OI = i, and P the point such 
that Z lOP = e (Fig. 6.8). Then OP = antilog B by definition. Draw 



Fig. 6.g_The antilogarithm of a small angle 0 

an arc of a circle with centre O and unit radius to meet OP at Z; then 
ZP = OP — OZ = antilog 0 ~ i. We therefore have a triangle/ZP 
(with two curved sides) such that the angle at P is </>, that at Z is 90 and 
that at I is 90 ^ - <t>. Also ZP = antilog 0 — i, and the arc IZ = H0. 
Now when 0 is small this triangle is small and becomes very nearly an 
ordinary straight-sided triangle. '1 hercforc when 6 is small 

ZP = IZ cot 
(antilog 0 — 1) = cot 

or 

antilog 0 1 - 0 . H cot 

It is usual to call i/(// cot <^) the “modulus” of the system of l^ogar- 
ithms, and denote it by the letter M (or /.). (I’his is therefore a different 
use of the word “modulus” from the meaning absolute value of 

Chapter 4.) So , 

antilog 0 ^ i ■\ • • • (6.28) 

when ^ is small. , 1 i- .* vr> u 

We can express this in a different way. Let the distance ZP be 

called m: then OP -- i 1 u antilog 0. so that 0 — log (i | u), and 

IZ ^ HO H log (1 ^ u). Now when u ZP is small, the triangle 


K 
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IZP is small, and has nearly straight sides and the angle 0 is small. The 
smaller u is the straighter the sides of the triangle, and the nearer 
IZjZP approaches the value tan tf>. Therefore IZfZP tends to tan <i> as 
u rends to o, i.e. H log (i -h u)jii tan ^ as m o. 

Dividing by /f, 

log (1 + «) _^ tw ^ ^ as . . (6.29) 

u H 

Thus the modulus M is the value which log (i + m)/w approaches 
when u becomes very small. We can use this definition to find the value 
of the modulus for any system of logarithms. For example, if we use a 
5-figure table of common logarithms we find the following values: 


Table 6.2—Values of log (i -f u), u 


u 

log (l + u ) 

log (i -1- m)/u 

1 

•30103 

-30103 

•2 

•07918 

•3954 

I 

•04139 

•4139 

■OS 

•021 19 

■4238 

•02 

•00860 

•430 

1 -OI 

■00432 

■432 


d'hus log (i ^ ii)hi is approaching a value M *43 as u grows 
smaller. In fact by using more accurate tables we can find M = *43429 
and AI~^ ■ 2-30259 to 5 places of decimals. Not only does each system 
of logarithms have its own modulus A/, but also no two different sys¬ 
tems can have the same modulus. For if log x is one system, then any 
other system is a constant mult!!;-‘i of this, sav k . log .v. Now by saying 
that the first system has modulus A/ we mean that when u is small, 
log (i 4- li) u diflers little trom ,1/, and in fact that as u approaches o, 
log (1 ; u) u approaches A/. 'I'he modulus of the second system /r.log x 

will accordingly be the value approached bv /f.log (1 4 r/)/M as u tends 
to o. and accordingly will be simply kM. Since k is not equal to i (for 
otherwise wc would be considering only one system of logarithms in¬ 
stead ot tw{) ditlerent systems) the modulus of the second system must 
be ditferent from that of the first. In fact it is just the modulus of the 
first .system nuilliplied by k. In particular if we divide all logarithms of 
any system by its modulus A/, we <;btain a new sy'stem of modulus i. 
d’his system was discovered by Napier in 1614 and is known as the 
system of “napicrian” (or “naperian”), “natural” or “hyperbolic” logar¬ 
ithms. 'I'he symbol In .v is now frequently used for the natural logarithm 
of .V, and that is the symbol we shall use here. (The reader is warned 
that this usage is not yet universal, and sometimes the natural logarithm 
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is written simply as log jc.) Accordingly this special system of “ natural 
logarithms” is defined by the equation In x = il/”* log x or 

log a: = Min a: . . . (6.30) 

Since the modulus of a system of logarithms is equal to tan 
natural logarithms can also be defined as the system for which H — i 
and ^ = 45°, i.e. that obtained from a 45® equiangular spiral when the 
angle 6 is measured in radians. The essential property distinguishing 
natural logarithms from other systems is that when u is small, 

In (i + m) M . . - (6.31) 

(or more precisely that In (i + u)fu -> i as o). 

Query'. What follows if we put .v = 10 in equation (6.30)? In 
general what is the relation between the base and modulus of a system of 
logarithms ? 

The antilogarithms corresponding to natural logs could be written 
antiln at; but in practice this notation is never used. If jy = In a*, we 
write X = exp y, the exponential function of y. exp y is therefore just a 
special kind of antilogarithm, the natural antilogarithm: from (6.28) 
it obeys the law 

exp y I -j- y 

when y is small. The number 

e = exp I 

is the number whose natural logarithm is i, i.e. it is the base of natural 
logarithms. (Another way of writing In .v is therefore log, x.) The letter 
is usually reserved in mathematical lorrnulas for this particular 
number, just as n always means the ratio 3'H^59 • • • of the circum¬ 
ference of a circle to its diameter. We can readily calculate its value by 
putting u = e in equation (6.30), giving log e = M, or e = antilog M. 
Since M = *4343 ... we have from tables of common antilogs e — 
2'7i 8. More accurately e = 2-71828182846 . . . 

A further consequence of (6.30) is that 

log (exp y) - - M In (exp y) 

My 

= y log e 

i.e. exp y = e*-, by definition (6.15) of <?■'. This accordingly gives 
us another way of writing the exponential function, and also a way of 
calculating its value from a table of common antilogs. 


PROBLEMS 

(i) Using common logarithms, find In 2, In 3, In 5. Verify that 

In 10 = In 2 4- In 5 = M"^ 
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(2) Using common antilogarithms, find e”^®. 

(3) Draw rough graphs of jy = In a: (say for o <; .sc < 5) andjy = «* 
(say for — 1*5 < a; < 1*5). 

To summarize, the natural logarithm and antilogarithm, In x and 
exp a: = e*, are simply slight modifications of the common logarithm 
and antilogarithm according to the formulas 

In AT = M~'^ log X 
e* = antilog iVfA;. 

But they are nevertheless functions of the greatest importance in every 
branch of mathematics, both pure and applied to biology, chemistry, 
physics and engineering. We shall simply remark here that we have 
already mentioned that functions of the type C* are important, e.g. in 
the gro\\th of bacteria. Now C* is by definition antilog (a: log C) in 
any system of logarithms; using natural logarithms, C* = e* = 
e®* where r = In C. 

The reader should note the general behaviour of the two functions 
(or of logarithms and antilogs in general). When x is small and positive, 
In X is large and negative. The logarithm increases rapidly, until when 
.V = I, In .V = o, and it thereafter becomes positive. Its rate of increase 
slows down, and after a time it increases only very slowly: when 
a; = 10,000,000,000, In x is only 23 o. The exponential function be¬ 
haves very differently. It is always positive, but for values of .v which are 
negative and not too small e* differs inappreciably from zero. For 
example, if a: < — 5, e' < 007, and if x < — 10, < 0001. When 

AT = o, e-' -- 1, and thereafter the function increases more and more 
rapidly; c.g. = 148, e^® ~ 20000. 

6.13 Hyperbolic functions 

We remarked in Section 6.2 tliat there is a certain analogy between 
antilogarithms and other trigonometric ratios. This analogy is made 
even more striking by the use of certain special “hyperbolic functions”. 
The ‘ hyperbolic cosine” of .v is defined to be ^(e* €~^) and is written 

cosh A', and the “hyperbolic sine” ot .v is defined to be i(e' — e~*) and 
is written sinh v. 

At first glance the most striking property of these functions is the 
apparent lack of any resemblance whatever to the ordinary sine and 
cosine. 1 he graphs ot these functions are shown in Fig. 6.9. Unlike 
the ordinary cosine and sine they have no periodicity or wave form. 
1 he cosh curve rather resembles a parabola, but curves upwards more 
rapidly. It is in tact the shape of a hanging chain, and is sometimes 
called a ‘ catenary” (from Latin catena = chain). Also unlike cos a;, 
which never exceeds i. cosh .v > i for all .r, as can be proved in this 
way. Since — i)- is a perfect square, it is always positive or zero, 
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while e~^, being an antilogarithm, is always positive. 'I’hcrefore 
(e'— 1)2 is always positive or zero, i.e. 

le-' [(e')^ ^ ze' } i] o 

Hut — e-^ ^ — e^, and e" — i, so that this 

reduces to 

ie' - I -f he-' > o. 

Adding 1 to each side of this inequality we obtain cosh jc > i. 'I'he 
graph of sinh a: on the other hand is a sinuous curve, taking all values 
positive and negative, and steadily increases from one end of the Ar-axis 
to the other. 
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By analogy with tan x = sin x/cos Xy etc., we can define four further 
hyperbolic functions 

tanh X = sinh Ar/cosh x = (e® — e“®)/(e* + e“®) 
coth X = cosh ;c/sinh x = (e* + e“*)/(e® — e”®) 

sech X = i/cosh a? = 2/(e* -f- 
cosech a: = i/sinh a; = 2/(e® — e”®) 


[The words “sinh” and “tanh” offer difficulties of pronunciation to 
anyone who is not Welsh or Indo-Chinese. There are two ways com¬ 
monly used to avoid these difficulties. One is to pronounce the h as 
“sh”, saying “sinsh” (with long i) and “tansh”. The other is to call 
them “shine” and “than”, with th as in “thank”.] 

The function tanh x is nearly — i when a; is large and negative, and 
nearly i when a: is large and positive. It changes over from one value to 
the other quite quickly near a? = o. The function i -f- tanh x accor¬ 
dingly stays for a long time near zero, suddenly increases quite rapidly 
to near 2, and then again becomes nearly constant. Apart from a scale 
factor this is just the law of behaviour of a species introduced in small 
numbers to a new and favourable habitat, e.g. a few bacteria intro¬ 
duced into broth. It is probable that the gromh law does in fact 
approximate to a (i -}- tanh .v) or “logistic” form in many cases (Section 
10.8). 

We shall leave the reader to examine the general behaviour of the 
functions coth, sech and cosech. 

So far there is little resemblance between sines and sinhs, or cosines 
and coshes. But it is possible to show that algebraically their properties 
have many analogies. For example, from the equations 


cosh .V -f- sinh .v — ^(e® 4- e~^) + ^(^® — e"*®) = 
cosh -v — sinh x = ^(e® -{- e~^) — ■J(e® — c“®) — «■** 
we sec that 


or 


(cosh .V 4 sinh .v) (cosh x — sinh x) = e^. e-*, 


(cosh .v)® — (sinh .v)“ = 1 , 


(6.32) 


riiis gives the first comparison with ordinary trigonometric functions, 
for which (cos .r)- (sin .v)- ~ i. We know that the point (a cos 
h sin <f>) lies lor all values of d> on the ellipse x^/a- y^jb^ — i. Equation 
(6.29) shows that the point (« cosh b sinh always lies on the hyper¬ 
bola .%•-/«-—-- 1, whence the name “hyperbolic functions”. 
( 1 his also suggests why the onlinary trigonometric functions should be 
periodic, and hyperbolic functions not. For an ellipse is a closed loop; 
if we go round it once we return to the starting point, and then every¬ 
thing repeats. But the hyperbola is not closed; it stretches away to 
infinity.) In fact, we can make the analog)^ a little closer by anticipating 
some results of the integral calculus. Draw the circle at- y- = i and 
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the rectangular hyperbola = i. Let P be a point on the circle, 

P' a point on the hyperbola, and I the point (1,0) (Fig. 6.10). The area 
of the sector lOP is proportional to the angle 6 = Z_IOP\ for clearly if 
we double this angle, we double the area, if we multiply the angle by 3, 
the area is multiplied by 3, and so on. Thus area lOP = kd, where k 
is a constant. Suppose 9 is measured in radians; then when 9 = 2ir the 
sector lOP becomes the whole circle of unit radius, and has area tt 
(Section 11.13). Thus tt = k.2TT, or or ^ = 2 area lOP. This 

can be taken as an alternative definition of the angle 9 = /_ lOP; P is 
then the point with co-ordinates x = cos 9 , y = sin 9 . By analog let 
us call twice the area between the hyperbolic arc IP' and the radii O/, 
OP’ the “hyperbolic angle” 9 \ then P' can be shown to have co-ordinates 
(cosh 9 ', sinh 9 ’). 




Fig. 6.X0— Ordinary and hyperbolic angles 


Another analogy comes from the formula for cosh (.v -f- y): 

cosh (jc + y) = i (€"■+ + £-*-■') 

= i {e'e'' + e-^ f-*') 

= ^ [(cosh X H- sinh x) (coshy -{ sinhy) 

-}- (cosh .r — sinh .v) (coshy — sinhy)] 

= cosh x . cosh y sinh x . sinh y 

on multiplying out and simplifying this expression. Compare tliis with 
the trigonometric relation 

cos {x + y) — cos X . cos y — sin x . sin y 

The hyperbolic formula has the more natural sign -I- l)el\\cen the 
two terms; but otherwise the two formulas are alike. 

Proceeding in this way we can investigate what formulas connecting 
hyperbolic functions correspf>nd tf* the trigonometric formulas of 
Chapter 5. We find the following remarkable analogies. {'I’he proofs 
are left as an exercise for the reader.) 
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COS o = sec O =: I 
cosh o — sech 0=1 

sin o = tan 0 = 0 
sinh o = tanh 0 = 0 

If X is small (and measured in radians): 

cos X sec X £ 2 =: 1 

cosh X sech x 1 
sin X tan x a: 
sinh X ~ tanh a* ==£ a: 
cosec a: ==£ cot x ^ ifx 
cosech a: coth x ^ ijx 

For all values of .v and y (in any units): 

(cos xY + (sin xY = I 
(cosh xY — (sinh xY = i 

(sec a;)- — (tan xY = 1 
(sech a:)" + (tanh a:)^ = i 

(cosec a ;)2 — (cot .v)- = i 

(coth a;)- — (cosech a;)- — i 

cos (— x) = cos X 
cosh (— x) = cosh X 
sin ( — x) == — sin x 
sinh (— x) = — sinh x 
tan (— x) = — tan x 
tanh (— x) = — tanh x 

cos (x 4- y) = cos x . cos y — sin x . sin y 

cosh (x -i- y) — cosh .v . cosh y + sinh x . sinh y 

sin (x -f- _v) = sin x . cos y -}- cos x . sin y 

sinh (x 3') - - sinh x . cosh y -h cosh x . sinh y 

tan (x -i- y) (tan x -f tan 3')/(i — tan x . tan y) 
tanii (.V + _v) = (tanh x y- tanh y)/(i + tanh x . tanh y) 

cot (x -i y) (cot X . cot y — i)/(cot x + cot y) 
coih (x + y) ^ (coth x . coth y- i)/(coth x + coth y) 

cos 2x — (cos x)- — (sin .v)- = 2 (cos x)® — i 
cosh 2x .= (cosh x)“ y (sinh x)^ = 2 (cosh x)^ — i 

sin 2x ~ 2 sin x . cos x 
sinh 2x = 2 sinh x . cosh x 


tan 2x 


2 tan .V 
— (tan x)“ 


I 
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t 2 tanh X 

tanh 2x =-;-;—— 

I + (tanh Ar )2 

cot 2x = } (cot X — tan x) 
coth 2X = ^ (coth X + tanh x) 

cos 2 ,X = ^ (cos Jf)® — 3 cos A* 
cosh 3A = 4 (cosh xy — 3 cosh x 

sin 3A = 3 sin a* — 4 (sin xy 
sinh 3A* = 3 sinh a + 4 (sinh xy 

cos (a — y) = cos A . cos y + sin a . sin y 

cosh (a — y) = cosh A . cosh y — sinh a . sinh v 

sin (a — y) = sin a . cos y — cos a . sin jv’ 

sinh (a — y) = sinh a . cosh y ~ cosh a . sinh y 

tan (a — y) = (tan a — tan 3’)/(t + >’) 

tanh (a — y) = (tanh a — tanh y)/{i — tanh a . tanh 3-) 

cot (a — j)') = (I + cot A . cot 3')/(cot y — cot a) 
coth (a — 3/) = (i — coth A . coth y) (coth a — coth j) 

cos A cos y = i cos (a j) -f | cos (a — v) 
cosh A cosh y = i cosh (a -{- y) + i cosh (a — j) 

sin A sin y = — \ cos (a +>')*+ ^ t:os (a — y) 
sinh A sinh y = i cosh (a + j) — i cosh (a — y) 

sin A cos y = — ^ sin (a + jk) ^ sin (a — v) 
sinh A cosh_y = J sinh (a ^') + i sinh (a — y) 

If t = tan ^A then 

cos A = (1 — + f") 

sin A = 2//(i 4 /-) 
tan A — 2//(i — /^) 

If / = tanh ^A then 

cosh A = (l -f- — /“) 

sinh A = 2//(i — t^) 
tanh A = 2//(i + t'^) 


6.14 Inverse functions 

It is convenient to have some way of writing “the angle whose cosine 
(or sine, or tangent, etc.) is y”. 

If is a given function of x, say y = /'(a), then a is said to be the 
“inverse function” of _>», and this is usually written x - J'~^(y)- I hus 
if y is the square of a, >» = a'^ then the inverse relation is the square 
root, A = 4- y-j. If y is the logarithm of .v, then a is the antilogarithm 
of^*. Ifjv = InA, A ^ exp^'. Ifj = + '). ± i). 
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If y = sin Xf we write x ~ sin”^ = the angle whose sine is y. If 
y ~ tanh Xy we write x = tanh“^. (Another notation occasionally 
used is arc cosy for cos-^y, arc siny for sin-^y, arg tanhy for tanh-*y.) 
We could write antilogy as log-*y, but that is rarely done. 

The reason for the notation F-^(y) is this. If ^ is a number, and y = 
Ax, then inversely =ylA = A^^y. Although in the relation y = 
F(x) the letter F represents a function, not a number, analogy suggests 
that the inverse relation should be written x = F“i(y). 

Note that even if in the relation y = F(x) for each value of x there 
is only one value of y it does not follow that for each value of y there is 
only one value of x. As a rule that is not true: if y = x^, then giveny, 
a: can have two possible values, + Vy and — If y = cos then, 
given y, a; can have an infinite number of values—e.g. cos-^ o = 90°, 
^ 7 ® » • though it is true that of these values only 90® and 270° 

represent geometrically distinct angles. All others differ from these by 
multiples of 360°* In such a case x is said to be a “many-valued” func¬ 
tion ofy. For example, if y = a-=, a = ±^y is a “nvo-valued” func¬ 
tion ofy, the separate values and — Vy being two “branches” of 

the function. 

The inverse hyperbolic functions have alternative expressions in 
terms of natural logarithms. For example, let .r = cosh-i y, so that 
y = cosh A = i (e* + e“'). Multiplying this equation through by 
we have 2e-y = + i, or - aye* + 1=0. This is a 

quadratic equation in e*, with solution 

= y ± V(y-- i), 

i.e. 

.V = cosh-‘y = In [y ± — i)]. 

In the same way by solving the equation y = sinh a = — e-^) 

we find that “ 

.V = sinh-»y = In jj- -l- V(y= + i)]- 

(We cannot have a^minus sign before the square root here because the 
quantity y \/(y*' — i) enclosed in square brackets would then be 
ncgalive, and so would have no logarithm.) 

■rhese formulas agree with the graphs of cosh .v and sinh .v. If 
' 7* ^ there are two points on the graph y = cosh .v having this parti¬ 
cular value of y. If y < i there are no such points. The formula 
cosh-‘ y = In L>- i _ I)] fails to give anv value if _ i < y < i 

because (y- — i), being negative, has no square root, while ify < — i 
the exj^essions y ± ^/(y2 >_ j) arc then negative and have no logar- 
^^1^1 L other hand there is just one point on the curve y = sinh x 
which has a given height 3-, so that the function sinh-i y exists for all 
values ofy and is a one-valued function, as is shown by the formula 

sinh-»y = In [_v 4- x/(y“ -H 1)]. 



LOGARITHMS 


139 


PROBLEMS 

(1) Find the values of cosh 2, sinh 2, tanh 2, cosh~^ 2, sinh“^ 2. 

(2) Express tanh“^ x in terms of the natural logarithm function. 

(3) Find the inverse function x = /“* {y) for the following functions 

y y ^ + 3 ^: (“) y = ^ + x-\ (iii) y = (iv) 

y = (i — x)l{i + x); (v) y = y'(i — x-). Which of these are single¬ 
valued ? 

(4) Show that 

sinh”^ a: -J-sinh”^ ^ = sinh“* [.v-\/(i + >'-) +>'%/(* 

« 

X -I- V 

(s) Show that tanh“* x + tanh“^ v = tanh~^- — . 

-r jr 1 -txy 



CHAPTER SEVEN 
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7.1 LfOgarithmic scales 

In drawing graphs we have so far always used a uniform scale of 
measurement along each axis. The actual length chosen as unit of 
measurement has been allowed to vary from one graph to another: but 

once the zero point or origin and unit have been fixed the whole scale is 
completely determined. 

For many purposes it is better to use scales with non-uniform 
graduations* As we shall see we can do many complicated calculations, 
such as the calculation of 

T = 1724 W H -725 c 

for given values of IF, and C, without any arithmetic whatever 
simply by the use of a “nomogram” consisting of suitably graduated 
lines, together with a straight-edge. (This formula relates the amount 

calories produced by a person in a day with the body weight 
ll in kilograms, the height H in centimetres, and C the number of 
calories produced per square metre of body surface per hour.) 

I.et us take a straight line L and choose a point O on it as zero point 

nr> V- ^ ''' shall call the true distance 

„ A : this can be measured by graduating the line OP uniformly. 
But if we graduate the line in such a way that the mark “.v” occurs at 
the point P such that OP = A" — log .v, then the line is said to be “logar¬ 
ithmically graduated” (Fig. 7.1). Thus the true distance along the line 

is then the logarithm of the number marked (e.g. the mark a: = 100 
occurs at distance A' = 2). 


0 

Fig. 7.1 



A logarithmic scale compared with a uniform graduation 

below the line 


It follows that by comparing a logarithmic scale with a uniform scale 
we can read oil at sight on the uniform scale (AT) the logarithm of the 
number (x) on the log scale; or inversely we can read a: = antilog X. 
On a logarithmic scale the points marked i, 10, 100, . . . occur at ^ual 
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distances apart. Hence we can represent an enormous range of values 
on such a scale. E.g. if we consider lengths, we find in living creatures 



Largest trees 

ufhales (largest animals) 
elephant 

man 


small mammals 
lar^e insects 


minute insects 

protozoa 


Oacteria 

limtl of light microscope 
smallest Iracilli 

Cacleriophagc, virus 
large molecules 


Fig. 7,2—Sizes of living creatures on a logarithmic scale 


a variation from about 5 x 10"“ metre for a bacteriophage, if that can 
he considered as living, to the greatest trees witli heights of just over 
10^ metres, or to the seaweed in the Sargasso Sea, sometimes perhaps 
<>f even greater length. I.ogarithmicallv (usinu' common logarithms) 
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this represents a variation merely from ^ = —7*3 to -PT = +2. Since 
the limit of vision with a light microscope is about | of a micron, or 
logarithmically —6*7, practically the whole of this range is covered by 
ordinary vision unaided or with the help of a microscope. Adult man 
occupies roughly the range from *2 to -3 (Fig. 7.2). Such a scale also 
shows more readily how living things compare in size with the physical 
universe as a whole. The diameter of atoms and very simple molecules 
comes in the range A = —9 to —10, while the size of an atomic 
nucleus is, rather surprisingly, of the order of —14. In the other 
direction, the earth’s radius has logarithm 6-8, the distance of the sun is 
11-2, the nearest stars at about 17 and the extra-galactic nebulas at 22. 
At somewhere round 25 we may be approaching the farthest limit of 
vision. Thus the whole scale of the universe from the smallest objects 
known to the greatest observable distances can be compressed logarith¬ 
mically into about 44 units (using common logarithms, or just over 
100 using natural logs), although such a comparison in ordinary 
unlogarithmic units utterly defeats the imagination, as it amounts to a 
ratio of lo*"* to i. 

7.2 Logarithmic graph paper 

Relations of the form y = iCC* occur fairly often in biology. (This 
relationship may also be written y = ATe®*, where c = In C.) If we 
have a set of observed values of x andy we may often ask whether .these 
are consistent with an equation y = ATe®*, and if they are, what are the 
numerical values of the constants K and c (or equivalently JC and 
C ^^)* 

ro take a concrete example, Dr (now Dame Harriette) Chick treated 
anthrax spores with 5 per cent phenol at 20*2° C and found after various 
times a; the number y of surviving bacteria: 


X (time in hours) 

1 

1 

0 

■5 1 

J *5 

1 

2-7 , 

5-95 

25-6 

j y (number of sur\'ivors) 

434 

1 

410 

1 

351 

1 

33 J 

i 

241 

28 

y logy . . . .| 

1 1 

264 1 

i 

2-6i 

1 

2-55 

1 

2-52 

1 

2-38 

I ‘45 


Now the equation y = KC^ is equivalent to 

log y = log a: H- .V log C 

or 

A -i-Bx 

where 

Y = log y, A = log K, and i? = log C = Me, 
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M being the modulus of common logarithms. If therefore we plot the 
values of log y = Y against x we should obtain a straight-line graph. 
We can do this by actually calculating values of Y as in the table above. 
Alternatively we can use graph paper in which the y axis is logarith¬ 
mically graduated, and plot the values of y directly. When a point is 
plotted with co-ordinate y according to the markings on the paper, its 
actual co-ordinate according to distance measured from the Ar-axis will 
^ = logy, so that we obtain the graph of Y against x, and can verify 
its straightness (in a rough way) without calculation (Fig. 7.3). The 



graph resulting from Dr Chick’s data docs in fact look straight, 
suggesting that apart from experimental error a law y = KC^ does fit 
the data reasonably well. 

It remains to determine the constants A and B in the equation 
T == -j- Bx. A can be interpreted as the value of Y when a; = o, 

and B as the slope of the straight line, 'i'he simplest way to determine 
them is to take two points on the line Y = A Bx with known co¬ 
ordinates X and y, and use them to provide two equations for the 
unknowns A and B. Here it is most logical to take the first and last 
points. At the first point a; = o, Y — 2-64 (— log 434 )« that A = 
2 64. At the last point a: = 25 6, K = log 28 = 1-45, so that 

1-45 = 2*64 + B . 25-6 

whence B = — 0465. Alternatively we can write the relation as 
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y = where K = antilog A = 434 and c — BJM — —‘107. Also 

C = antilog B = -898. 

It is clear that the constant K is merely equal to the value oiy when 
X ~ Oy i.e. the number of bacteria at the start of the experiment. It is 
therefore of no special significance outside the experiment in question. 
The second constant, whether expressed as J? or or C, is of more 
interest. It shows how rapidly anthrax bacteria are killed by phenol— 
at any rate as regards the culture used by Dr Chick, and under her 
conditions of concentration and temperature. We can interpret C more 
precisely as follows. At a time x there are KC^ bacteria living; one 
hour later the number is This is {KC^)C, so that the number of 

bacteria surviving at the end of an hour is a fixed proportion C of those 
present at the beginning. Here C =-898 = 89 8 per cent; about 
10 per cent are killed each hour. 

The method we used above for fitting the straight line was very 
crude: it only takes two points of the curve into account, although we 
can see by eye that the straight line joining these two points does in 
fact pass near the others. A more accurate method is the “method of 
linear regression” which will be discussed in Section 21.10. But the 
chief moral is that it is much easier in testing for a relation y = KC^ to 
plot log y against .v, and obtain a straight line, rather than to plot y 
against .v and obtain a curve. 

This device of reducing a graph to a straight line should be used 
whenever possible. Thus a relation of the form y = Kx^ becomes 
log y = log K B log .v. Then by plotting Y = log y against X — 
log .V we should obtain the straight-line graph Y = A BX^ where 
A — log K. If we have graph paper in which both axes are logarith¬ 
mically scaled the points can be plotted directly from the original values 
.V and y without the extra labour of finding the logarithms. K will then 
be the value ofy when a; — i, and B will be the slope of the straight line. 


EXAMPLE 

(^) he following are the pulse rates y of persons (adult and chil¬ 
dren) of different heights x (metres). 


I Icight .V 

•50 ; 

1 ’ 1 

•698 

! 

-796 

•S67 

-986 ; 

1 

1-765 

1 

■ Pulse rate v 

134 ; 

1 

III 

1 

108 

! 

104 

i 

• 

98 

i 

• 

■ 73 


On plotting these points logarithmically (Fig. 7.4) we obtain a straight 
line of slope — i which cuts the line .v = i at y = 94, i.e. B = — |, 
K = 94, and the equation isy = 94X'-^. 



GRAPHICAL AIDS TO CALCULATION 


145 



PROBLEMS 

(i) Dr Chick tested the bactericidal value of 5 per cent phenol at 
33‘3° C by mixing a number of bacterial spores with the disinfectant 
and estimating the number of bacteria in one drop at various intervals 
of time. She obtained the following results: 


Time in hours, t . . 

0 

■5 

1-25 

' 2 

1 

3 1 

1 

! 4-1 

5 

7 i 

* 

Number of bacteria, n 

1 

439 

275-5 ' 

1 

*37-5 

' 4 f> 

00 

5-5 ! 

' 3-5 j 

1 

•5 

j 


By plotting log n against t show that the rate of killing obeys the 
law n = number of survivors — Ke'^. Kstimate the constants K and c. 


(2) The following values have been found by Feldman and Clark 
for the rate r of a rabbit’s heart at various temperatures t (degrees 

Centigrade). 


1 

Temperature t 

'4 ! 

5-f» 

64 

1 

1 

1 2-8 ' 

13 6 j 

1 

140 

1 

160 

1 

Heart rate r 

5-9 

11-7 

123 

1 

25 9 , 

284 

297 

37-8 

1 


Are they consistent with a law of the form r 

L 



BIOMATHEMATICS 


146 

(3) The following figures represent the relationship between the 
weight JV of a child in kilograms and its area S in square metres: 


w 

2 

f 

3 

1 

1 

1 

4 1 

5 j 

6 

1 

7 

8 

1 

9 

10 

s 

-163 -214 

•260 

1 

■301 

•340 

1 

' -377 

' -412 

•446 

■479 


Plot log S against log IV: what law do we find connecting IV and S? 
How much milk will an infant of 2*3 kilograms require per day if the 
amount of heat lost by the body is 1700 calories per square metre per 
day, and the calorific value of milk is 736 calories per litre ? 

(4) The area of a wound was determined every 4 days by drawing 
the outline of the wound on a sheet of transparent cellophane. The 
following results were obtained: 


Time (days) 

0 

1 

4 

1 

8 

12 

1 

1 

16 ! 

20 

24 

1 

00 

1 

32 

36 

Area (cm-) . .! 

1 1 

' 1 

107 1 

1 

1 

: 88 

1 

74-2 

1 

'6i-8 

1 

■ 51 i 

416 

33-6 

1 1 

26-9 

213 

4 

i6'8 


Find the law connecting the area with the time, and find how many 
days should elapse before the wound is reduced to i square centimetre. 

Note-: Carrel, Hartmann, Lecomte du Nouy and others (J. Exp, 
Med., 24, 1916, and 27, 191S) have shown that the logarithm of the area 
ot a wound generally decreases linearly with the time. The rates of 
decrease of the logarithm, using different antiseptics and different 
dressings, can thus he compared. A marked deviation from the straight- 
line law indicates probable infection. 


7*3 Other non-uniform scales 

Next to the iniiforni scale, the l>\garithmic scale is the most conven¬ 
ient aiul \ aluable. Hut clearly it is only one cjf an immense variety of 
possible scales. We can have square-root scales, square scales, reci¬ 
procal .scales, and many others. IVIany ot these will be useful for special 
problems. In general we .shall say that a scale corresponds to a function 
^(.■v) if the point P whieh is marked “.v” on the scale is actually at a 
distance OP - /'(.v) .A from the origin O. Such scales can be used 
in very much the same way as logarithmic scales. But, except in a few 
cases, specially graduated graph paper is not available, and it is neces¬ 
sary to calculate the % alue t)t ..V separately for each observed value of .v. 
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We shall illustrate by an example in which a square root scale is 
used. Sjdquist studied the course of pepsin digestion by measuring the 
electrical conductivity jy of the protein solution. He found the following 
values of y for various times a- (in hours): 


y 

0 

— 1 

10-5 1 

164 

19-9 

22*7 

24-0 

27-0 

30-4 

337 

X 

0 

2 

4 

6 1 

i 

8 

9 

12 

16 

1 

20 

X = 

0 

1-41 

2 

2-45 

2-83 

3 

3 46 

4 

4*47 


If we plot jy graphically against X — \/x we find that the points lie 
nearly on a straight line passing through the origin (Fig. 7.5). That is 



Pig* 7*5—Digestion of pepsin 

to say the points obey a law of the formy = BX - - B\/x. 'I’he constant 
B is not far from 8 (when A' = 3, y - 24, yjX -■ 8). 'I’his equation 
y == B\/x is the Schiitz-Horisolf law. 

This example also shows the danger of extrapolating an observed 
law. Clearly the law y — B\''x cannot continue indefinitely, since it 
would imply that the conductivity y would grow to an indefinitely large 
value, whereas of course it must approach a certain limiting value L 
corresponding to complete digestion, 'i'he true equation is kx = 

Bin L ~ l,\n(L — y) — y, but this approximates to the lawy - By/x 
for small values of x, as can be shown by series methods (Chapter 13). 
Each problem will suggest its own most suital)le method of scaling. 
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If we were testing the equation for refraction, sin i = p. sin r, where i is 
the angle of incidence and r the angle of refraction, it would be most 
natural to plot sin i against sin r to see if this gives a straight line. To 
test the lens formula ijv — i/m = iff where v = distance of object, 
u — distance of image, and / = focal length, it would be natural to 
plot ijv against ifu. 

This technique can be used to find the “order” of an irreversible 
chemical reaction, i.e. the number of molecules taking part. Let C be 
the concentration of a reacting substance, t the time, and 71 the order of 
the reaction. Then if w = i, i.e. the reaction consists of the decompo¬ 
sition of a single molecule, we obtain a straight-line graph by plotting 
log C against t . (See Section 16.23 for ^ detailed discussion of this 
point.) If « > I, then we must plot against t to obtain a straight 
line. A few trial plots for different values of « will usually determine 
the correct value. 



EXAMPLE 

(i) Alaclsen and \\ album studied the decomposition of tetanolysin 
by means ol peptone, obtaining the following results: 


: Time in hours t 

\ 

> 

■5 ' 

1 

I 

1 

2 ^ 

1 

1 

4 

> 

6 

1 

i s 

Concentration of 
peptone C . . 

« 

1 

47-7 ' 

1 

' 1 

' 39-7 

1 

; 30-3 

22-3 

1 

1 

i8*i 

170 

c-^ . 

1 

•021 

' -025 

1 

i -033 ! 

■045 ' 

1 1 

■055 

•059 

! 
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If we plot C~^ against t we obtain roughly a straight line (Fig 7 6) 
whereas the plot of log C or of C-2 or C-^ against / is curved. This 
suggests that the reaction is bimolecular (C*-" = when n = 2). 
[More elaborate in estigation indicates the scheme— 

2 molecules of tetanolysin 3 molecules of peptone lysinpeptone; 

the lysinpeptone immediately disintegrates with re-formation of pep¬ 
tone. Thus the peptone is not exhausted.] 

PROBLEMS 

(i) Madsen and Walbum studied the process of tryptic digestion by 
subjecting 10 grams of casein powder to the action of 100 cc of a i per 
cent solution of tripsin at constant temperature. The amount of casein 
remaining was found by Kjeldahl’s method of nitrogen determination. 
The following figures represent their results: show that they agree with 
the supposition that the process is a bimolecular reaction. 


Time 

(hours) 

0 

■5 

25 

1 

6 j 

! 

1 

11 

1 

33 

48 

72 

Nitrogen ' 
concentra¬ 
tion 

•I 10 

1 

1 1 

■108 

•102 

1 

1 

1 

) 

•100 

1 

i 

■096 j 

■076 j 

•070 

•060 

■049 


(2) Madsen and Famuicner found the following results for the con¬ 
centration of vibriolysin at 28'’ C. Find the order of the reaction. 


Time (minutes) 

0 

10 

1 20 

30 

■ 40 

5° 

60 

Concentration 

4 

0 

0 

4 

0 

78-3 

676 

1 

59'3 ' 

49-8 

' 40 8 

34'4 


(3) The following values have been found for x and y: 


X 

0 

1 

2 

1 

3 

4 

4 

I 5 

6 

7 

y 

6-29 

1 

5-72 

; 5-22 

1 1 

4-78 

435 

' 4 06 

375 

00 


By plotting xy against y show that these fit reasonably well to a 
law y = cil{x + b), and find the values of a and b. 
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By plotting log y against x show that they fit reasonably well to a 
law j =s and find K and B. 

This again points the moral that two quite different formulas can be 
in good agreement over a certain rangCy though outside this range they 
may behave very differently. Compare the values of y given by the two 
formulas when x takes values near — b. 

7.4 Slide rules 

If we have two ordinary uniformly graduated rulers we can readily 
add and subtract in the following way. 

Let Oi be the zero point on one ruler, and P, a typical point 
==■ 4, say, on the scale. This means that the distance O^Px is units. 
Now place the second scale alongside the first with its zero point Og 
opposite Pj (Fig 7.7). Let be a typical point on the second scale 



Fig. 7.7—Slide rule for addition and subtraction 


“A'2” = 5 (in the figure), so that the distance OgPg is X^ units. It 
follows that the distance O^P.;. = O^Pi -f- O2P2 = AT, + ATg. Thus 
opposite P2 on the lower scale will occur the point AT, + ATo on the 
upper scale. If we set P, at the point A', = 4, we see from Fig. 7.7 
that 44-1 = 5 (opposite i on the lower scale is 5 on the upper), 
4 !- 2 “ 6, 4 r 3 = 7, and so on. 


Suluraction is equally simple: it is just the inverse of addition. If 
{opposite a point P^ marked on the upper scale we place the point Pg 
marked A o on the lower scale, then opposite Oo on the lower scale we 
shall find P, on the upper indicating (A'g — A'o). For example, 9 — 5 
4 {Fiu. 7.7). For 0 ,Pi - — O P. ^"A'g — X.. 

NaUirally one would not in practice use such a device merely for 
addition and subtraction: it would be too cumbersome. But if we 


gratiuate the scales non-uniformly then this becomes a veiy’ powerful 
instrument, l or example, it we graduate logarithmically, so that the 
point Pi is marked .v, when it is at a distance OjPj - : A’l = log .v, from 
O,, then, instead of adding, the slide rule multiplies, and instead of 
subtracting it divides (Fig. 7.S). For now P, is marked “.Vi”, where 
OiP, --- log Xi, P.. is marked ".v.,” where OP. = log Xg, and P3 
opposite P., is marked “.V3” ( 12) where 6,P3 = log Since 
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^1^3 = OjP, + OgPo, we have log = log .y, -f log x.^y i.e. x^ = 
^i^2» *^1 = x^/xn- Thus in Fig. 7.8 the mark “i” on the lower scale is 
opposite “3” on the upper scale, and the rule is in a position to multiply 
by 3. Opposite 2 on the lower scale is 6 on the upper, so that 2 x 3 = 6; 
similarly 3 X 3 = 9, 4 x 3 = 12, and so on. Alternatively the same 
setting does the division 12/4 = 3. 


0 , i 

1 

__1_ 

J 5 7! 

4 1 6 1 8 

1 1 1 1 1 

P. 

? tl i 15 !5 

10 \?. 14 

1 J_ 1 



1 

c 

1 

2 

1 : 

f 1 I 1 M rrr 

.r-4 1 6 1 8 1 10 1 1?M5 

i " 5 7 $ 11 15 15 

P. 


Fig. 7.8—Slide rule for multiplication and division 


A pair of logarithmically graduated rules like this forms the simplest 
form of logarithmic slide rule, and can be used for multiplication and 
division. It is specially convenient for doing a scries of multiplications 
and divisions, such as (2 73 x ioo 6)/(230 x 5f>'2). Most modern slide 
rules have a number of additional scales which enable one to find square 
roots, squares, cubes, reciprocals, natural logarithms, e.xponetuials, 
ordinary logarithms, powers (a "), sines, cosines, and tangents, with the 
minimum of trouble. The exact details vary with the make of slide 
rule, and we shall not discuss them here, since they introduce very little 
that is new in principle. 

It is, however, worth noting that it is a fairly simple matter to make 
slide rules with graduations on other functional scales l^esides the famil¬ 
iar logarithmic one. If we use a square graduatioji the rule will solve 

x'^y i.e. compute the sides of riglit-angled triangles, and 
also calculate “sums of squares”, an important statistical operation. In 
such a graduation the mark x must be placed at a distance .v- from the 
2ero point. By graduating by reciprocals we can solve the letis equation 
ifv — ifu = i//. 


7‘5 Nomograms 

Instead of using a sliding scale ff>r additif)n we can often use fixed 
scales and a straight edge as indicated in I'ig. 7.9 overleaf. Here we 
have three parallel scales which we may call the j,, y.. and y^ sc.ilcs. 
They arc drawn vertically for convenience in use; and since a vertical 
scale is usually labelled “y” as a matter of convention, that is our reason 
for the use of the letter j here. If nf)W the pf)inl tnarked y, ( -2) on 
the first scale is joined by a straight liiie to tlie poiiu nuirked y.^ ( 3) 

on the third then this line will cut the central scale at the point 
y* (A proof of this property will be given in the next section.) 
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Scale of 

y^ 



Scale of 





Fig. 7.9—Nomogram for addition 



Fig. 7.10—Nomogram for mxUtiplication 
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The dotted line shows the addition 2 + 5 = 7, or alternatively the sub¬ 
traction 7 — 2 = 5. By replacing the uniformly graduated scales by 
logarithmic ones we shall get multiplication instead of addition as in 
Fig. 7.10, where the connecting line shows the multiplication 2x4 
=8, or alternatively the division 8/2 = 4. Charts such as these in which 
a calculation is performed by joining suitable points by straight lines 
are known as “nomograms” or “alignment charts”. 

We shall consider here only the simplest kinds of nomograms in 
which all scales are vertical and parallel. This touches only the fringe 
of the subject, for it is clear that if we allow non-parallel and curved 
scales the possibilities are immensely widened. But for further details 
we must refer readers to books on the subject, such as S. Brodetsky, 
First Course in Nomography^ Bell, 1925, or A. S. Levens, Nomography, 
Wiley, 1948. 



Fig. 7 #ii—Principle of a parallcl-scalc nomogram 

7«6 Nomograms with three parallel scales 

Fhe fundamental theorem of mnnography is this. 

three parallel straight lines, and O,, Oj and 
be three points on and L, respectively lying on a straight line. 
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(As a rule O2 and O, will lie in the same horizontal line; but for 
the sake of generality we shall not assume that to be so.) Let Pj, Pg, 
Pg be the three points in which a straight line cuts Lj, and Lg 
respectively. Call the actual distances OiP,, 0 ,P,, Fi, Fa 

and Yg respectively. Furthermore, let the distance between and Lg 
(measured positively if Lg is to the left of Lj, negatively otherwise) bear 
to the distance between L, and Lg a ratio A : ^ (Fig. 7.11). 

Then 

Fa = (AFa +/xFj)/(A -f//) . . (7.1) 

Proof. Draw 2 Parallel to OjO^Oa and horizontally 

through Pj, with and on L^ and and on Lg. Then the 
triangles Q^H^Pg and QfiJ^g are similar, so that Q^HfH^Pg = 
Q zP3 (using the proper signs, upwards distances being measured 

positively, and also those from left to right). This relation can be 
written 

In the same way the triangles HiP^Pg and H^P^Pg are similar, so that 

H,PJX= -H^PJf.. 

Adding these two equations we get 

Q,PJX= -Q^Pzi^ 

But 0 iPi = OjPi — = Fj — OgPg = Yi — Fg, and similarly 

QP2 = Yz — Tg, so that 

(F, - Fg)/A== -(Fg_Fg)/> 

Solving this equation for F,, we obtain 


^ (AI 2 + m5’^i)/(A + (J-) 

whicli is (7.1). 'This equation can also be written in the form 


F. 


S - - K-F, 


(I -k)F2 


. (7.2) 

where --- /^/{A /.). for i — k- A,(A - 

I he sinqilest example is when A /i, k h. ^'he Lg scale is then 
situated midway between the L, and L., scales, a'nd F, = A (F, + Fg). 
If we graduate all three scales uniformly and with the same unit, then 
by joining the point Pj niarked 5 , on L^ to the point Pg marked F2 on 
L. we obtain a line cutting the Lg scale at the point p” marked F, = 
^ 1 b ^ 2)* ^o that this nomogram enables us to find the av'erage of 

two numbers I j and i 2- 1 lowever, it is more useful to graduate the Lg 

scale with unit of graduation half that for the L, and L. scales, i.e. so 
that the mark Vg occurs at distance O^P, - - Ay, along the scale. 

\Ve then have 



-- 2(^1 4 - i 2) 





or 
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and the nomogram, which is the one shown in Fig. 7,9, enables us to 
add the two numbers and Vo. 

More generally we can say that if a unit of graduation on the scale 
Li has actual length U ,, then the marky'j on the scale will occur at actual 
distance O^Pj = K, from O,. Similarly if f/^ is the actual 

length of the unit of graduation on the L 2 scale, and t/, on the L, scale, 
then Y 2 = y^U 2 and Yg = ysUg so that (7.2) becomes 

Ugyg ^ Kt/O'i + (1 — «) C/aja 

The readings of the points P,, and P, on the scales so graduated 
will be ^,,^2 andy* and will be connected by the relation 


y, = {xU,IUg)y, + ([I - UJUg)y, 

= -f ^2>’2 .... (7.3) 

where — kUJU g and B^ — [i — /f] U2!Ug. By choosing the quan¬ 
tities K, Ui, U2 and U, suitably we can construct a nomogram to calcu¬ 
late any desired combination of the form; P,ji +B2y2- 

Fortunately it is possible to avoid even this amount of calculation. 
This is because we can set up and graduate the lines Pj and Lo exactly 
as we wish: the line Lg on which the answer appears will then be fixed 
both in position and graduation, but is easily determined graphically. 
(This amounts to saying that we can choose any value we like for f/j and 
^li but that, given the values of B^ and we can then find those of 
K and Ug.) 

As an example we shall consider the following problem. The calor¬ 
ific values of protein, carbohydrate, and fat are approximately 4, 4, and 
9 calories per gram respectively. We wish to construct a nomogram 
wluch will enable us to find the calorific value of a diet of given compo¬ 
sition. If we let y^ denote the total number of grams of protein and 
carbohydrate together, andyg ^he number of grams of fat, then the total 
calories will be y, = ^yi + 9^2- 

Now draw two parallel lines L,, at a convenient distance apart 
to serve as scales for jy, andyg* graduate them on any suitable scale. 
We may graduate from o to 500 grams, atidya from o to 300 grams, 
since the total consumption of fat will in general be less than that of 
protein and carbohydrate combined (Fig. 7.12 overleaf). We find the 
position of they, scale as follows. Since a diet consisting of 450 grams 
protein and carbohydrate and no fat has a value of 1800 calories, the 
line joining the mark 450 on the first scale to o on the second must pass 
through 1800 on they, scale. So also must the line joining o on they, 
scale to 200 on they^ scale. 'I’he [loint where these two lines intersect 
triust therefore be the point marked 1800 on P,. So P, is the line 
through this point parallel to P, and P^- 'I’he zero point on P, will be 
Its intersection with the line 0 , 0 .^. Having found P, and the marksy, 

for two points on it we can complete the graduation l)y simple propor¬ 
tion. 
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This method can also be used, with only slight modification, to con¬ 
struct nomograms for expressions of the form = A -f- 

where A, Bjy B2 are constants. As an example, suppose we want 
a nomogram for yg = 100 + ^y^ -J- gy^- We have already shown how 
to construct one for yg = ^y^ 4- ^y^'y the new nomogram will differ 


Protein + 
Carbohydrate 

(y, grams) 


Heat i/alue Fat 

(Vs calories) grams) 



t’w i" by 100. In other words all 

TOO thr. ! f for r, .. 4y. + gy, and add 

the no “° ntarked on the line Z,j. If we were constructing 

To, th' ° '’“' "’R previously obtained one for y. = 

t T ni" 'n°V'‘* as before, drawing 

two su.table parallel scales L, and i, for v. and y, respectively 

We now note that the values y, ^ 450. y, .= o and v, = o. v. = 200 

• A < V •• 
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both give = 1900, so that the lines joining the points marking = 
450 and o to the points jo = o and 200 respectively must meet on Z,, 
= 1^0- •f's is the line through this point parallel to and L2. 
ru f ^1^2 joining^yj = o to y'o = o "ill now meet Lg at yg ~ 100 
the formula y^ = 100 -f 4jyi + 9^2) instead of at the point y^ = o. 
Having found two points on Lg we can graduate the rest of the line by 

simple proportion. 

Of course it is rarely necessary to construct a nomogram for such a 
simple formula as A But if we use logarithmically 

graduated scales instead of uniform ones this becomes 

logjVs = ^ 4-logjVi + ^2 I«g^2 


yg = Ky^B^y2^‘ 

where K. = antilog A. For example we can construct a nomogram for 
the Dubois formula S = -2025 W'^-^ where S is tlie surface area 

of the body in square metres, the weight in kilograms, and//the height 
m metres. Expressed logarithmically this becomes 

log S = —6936 + 425 log W + 725 log H. 

(since log 2025 = 1-3064 = — *6936). 

Naturally there must be bounds to the values of S, If'and H under 
consideration. Suppose we allow the height II to range from -5 to r8 
Metres, and weights W from 2 to 70 kilograms. 'I'hen the logarithm of 
H will vary from (roughly) —3 to +-25, a total range of *55, and the 
logarithm of W from -3 to 1-85, a total range of 1-55. 

We must now choose a convenient length for our scales—naturally 
the longer the scales the greater the accuracy of the nomogram. If we 
about 15 cm to be a convenient length, then since the logarithm of 
H varies by a total amount of -55, this means that we can graduate the 
/* scale with 3 cm representing a change of i in the logarithm (thus 
giving an actual total length of 3 x 5 5 = 16 5 cm). 'I'he lowest point 
m the scale corresponds to a logarithm of about —-3, and therefore the 
?cro point Oj occurs about 3X3 = 9 cm up the scale. 'J'he first stage 
in the construction of the nomogram therefore consists in drawing a 
vertical line L,, about 17 cm long, and marking the zero j^oiiu O, about 
9 cm from the bottom. 'I'his point O, will actually be marked i, since 
1 = 0. We can now readily mark off the other pr)ints, since the 
of the point marked H from the zero point O, is 30 log H cm. 
(this follows from simple proportion: a difference of -i in the log = 

3 cm, i.e. a difference of i in the log • 30 cm, and a difference log II r - 
30 log H 

cm on the scale.) 'I’hus the mark for 11 metres is placed 
30 log i-i = X -0414 ^ 1-24 cm above O,. 'I’his is shown (on a 

reduced scale) in Fig. 7.13. 

fn the same way on the L.g scale for weight JK since this is to cover 
a range 1-55 in logarithms and about 15 cm in actual distance, it will be 
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convenient to take i cm = a difference of *1 in the logarithm, or lo cm 
= a difference of i in the log. Unfortunately here the logarithm ranges 
from *3 to 1-85, so that the zero point does not lie on the part of the scale 
we are considering. We can overcome this difficulty by using another 
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Fig. 7 .X 3 —Nomogram for surface area and loss of heat per day 


reference point on the scale, say R^. where log XV = i, and accordingly 
l-F = 10 . Since the bottom of the scale corresponds approximately to 
log IV --- * 3 , this reference point will come about 10 (i — '3) = 7 
up the scale. We therefore draw a second vertical line La at (say) 8 cm 
distance from Lj, and mark the reference point 

remainder of the IV scale is then readily graduated, the point fF. e.g. 
20 kg, occurring at a distance 10 (log IV — i) = 3’Oi cm above Rz- 
Having drawn the // (Lj) and IF (Lo) scales, we have to find the 
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S (L3) scale, both as regards position and graduation, so as to give the 
equation 

log 5 = —694 + -425 log W + -725 log H 
when used as a nomogram. 

This can be done in two ways. Graphically we do it by noting that 
if log W = 1, log H — -425, then log 5 = —694 + -425 + *725 x 
•425 = *039, and if log W = 1*725, log //= o, then log S = —-694 -f 
1*725 X *425 = *039 again. Thus the line joining the points log H = 
•425 on the H scale (30 x 425 = 12*75 above Oj) to log W= 1 
(i.e. R 2) on the W scale should meet the line joining the point log H = o 
(i.e. O,) to log W = 1*725 (i.e. 10 x 725 = 7 25 cm above at an 

intersection point log S = *039 on the S scale, L^. We can then draw 
Lf vertically through this point. The line joining Oi (log H = o) xo 
R2 (log W = i) will meet the »S scale where log S = —-694 + -425 = 
—269. Having found two points in this way on the S scale Lg we can 
graduate it by taking simple proportion in the logarithms. 

Another and perhaps simpler way is to use the fact that the equation 

log S = A + Bj log H + B2 log W 

holds, where 

B, = ^2 = [I - K-] UJU, . . . (7.4) 

and C/, is the length of the scale corresponding to a difference of 
I in log H, U2 the length of the scale corresponding to a difference 
of I in log W, Ug the length along L, corresponding to a difference of 
I in log S, and k is the ratio of the distance from L, to to the dis¬ 
tance from to L^- This equation is simply an adaptation of (7.3) to 
logarithmically graduated scales. 

Now we arc aiming at the values A = —-694, /i, = *725, = *425; 

and we have chosen Vi = 30 cm per unit log, t/2 = 10. ('rhese num¬ 
bers t/j and U2 are sometimes called the moduli of the scales; this is 
still another use of the word “modulus”.) Thus we have to solve the 

equations 

/i, = nUJUg, i.e. *725 = 30K/t/, 
and 

B2 = [i - K-lVjVg i.e. -425 = lofi - 

for the unknowns k and Ug. This is done thus. 

"^rhe first equation can be written BJU^ = KjU„, and the second 
B2IU2 = (i — K)jUg. On addition we get ijVg BJUi 4-Z?2/f^2, 
or 

Ug -- [BJU, -^-BJU^]-' 

= [-0242 -I- -0425]“* 

= [0667]“’ = 15*0 

Now 

^ '0242 

== - 3^*3 
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and the distance from L^to 

= K X (distance from to 
= 8 k = 2*90 cm. 

We can therefore draw Lg 2 90 cm from Zrg, and we know that on this 
scale a difference of i in log S corresponds to a distance of Ug = 15 cm. 
It remains to find a reference point on this scale from which we can plot 
the others. A convenient point is the zero point Og, log 8 = 0 ^ 3 = 1. 
By our equation this will occur when 

o = —694 + 425 log W H- -725 log H. 

We can join the points where log H — o (i.e. the point Oj) and where 
log W= •694/ 425 = 1633 (i.e. a point 10 X *633 = 6-33 cm above 
i?i). This line will intersect Lg at O,, and from this the remainder of 
the scale can easily be graduated. We have now completed the nomo¬ 
gram; a line joining points H and W will intersect Lg at S where log 
8 = —694 + 725 log H + 425 log W, i.e. 3 ~ *2025 H 

To avoid conversion the H and IV scales can also be graduated in 
British units, 3 being still read off directly. 


7.7 Nomograms with more than three scales 

We can readily extend this scheme to construct nomograms for 
relations such as 

yt = yi +>'s +>'3 

or yt = A + B^y^ 

or log ^ -h -Si log^'i + Sg log Va + S3 log J3 


For we can decompose the equation — A Sjjyj B^yz + Say3 
into two equations by bringing in an intermediate number yg in the 
following way: 

yg = A + B^y^ + B^y^ ) 

y»=y.+ S3^'3 j 


We can construct a nomogram which gives ys from 3’1 and 3*2, and using 
the same scale for y, we can construct a second one giving y^ fromy, 
and 3’3. Thus the procedure in using the nomogram will be to join the 
points y 1 and Vg on their respective scales by a line meeting the y* scale 
at 3%; this point is then joined to theys scale, and will intersect they* 
scale at the required point v, ^ A By^ + S23’2 + B^^. In this 
process the line L, bearing they, scale is merely brought in as a help 
in performing the construction, and does not need to be graduated at 
all. It may, however, be useful to mark two or three points on Lg while 
drawing the nomogram: these marks can afterwards be erased. 

'I’hus in Fig. 7.13 (p. 158) we have combined the nomogram for sur¬ 
face area 3 —- *2025 H'~^^ ^vith the product nomogram T = 2 ^ 3 C, 

where C — heat lost in calories per square metre per hour, and T = 
total heat lost per day. '^I'his is a very simple nomogram to construct: 
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since S is already logarithmically graduated, all that is necessary is to 
draw the scale for C on a line parallel to that for S, and graduate it 
logaritlmically. The position and graduation of the T scale is then 
determined from the equation 


T = 24 SC, or 

log T = log 24 + log S -f- log C. 

In Fig. 7.13 the C scale has been graduated from 10 to 100. The lines 
joining the points *5 = 2 to C = 25 and S = -5 to C = 100 must both 
pass through the point T = 1200, which is thus determined. The 
vertical line through this point will be the T scale, and it will intersect 
the line joining S = -z to C = 25 at T= 120. Now the distance 
between the points T = 1200 and T = 120 will correspond to a differ¬ 
ence of log 1200 — log 120 = I in log T, and this distance will therefore 
be the unit Uj of graduation for the T scale. .-Vs we have now found 
two points and the unit, the remaining marks are readily filled in by 
simple proportion on log T; the point T will be placed at a distance 
(log T — log 120) U-r centimetres above the point 120. 

. As an example of the use of this nomogram we may find the surface 
area S and the total heat T lost per day for a person of height H = 
1*50 metres and weight IV— 60 kilograms, losing heat at the rate of 
^ = 40 calories per square metre per hour. 'The line joining the points 
fi and W intersects the S scale at S 1-55 square metres: and the 
line joining this to 40 on the C scale intersects the T scale at 1500 
calories per day. If we were only interested in the values of T we could 
clearly ignore the graduation on the S scale. 


I> K O IJ 1. I- M .s 

Construct nomograms for the following: 

(l) The respiratory qij(;tient, i.e. the volume of COj expired 
divided by the volume of inspired. [ I'lie scale for the volume of 
C028hould be placed midway between the scale for respiratory quotient 
and the scale for volume of O.^. 'Hie two external scales should be given 
the Same logarithmic unit, the values for the volume of CO2 running 
from 100 to 1000 cc. 'Fhe intermediate scale for Oj can be then grad¬ 
uated automatically for volumes of O^ ‘^^•1 

{2) I he colour index of blood, viz. the haemogloI)in (as per cent of 
the normal value) divided by the number of erythrocytes (as per cent 

of normal value). 

. (3) The gas equation, V TRiP. (If T is the absolute tentperatiire 

in degrees centigrade, P the pressure in newtons jht square metre, i.e. 
times the pressure in millibars, and V the volume in litres, then 
~ 8300 approximately.) 

M 
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(4) Pythagoras’s theorem, r* = (Use scales graduated by 

squares.) 

(5) The lens equation, i/v — iju = ijf. 

(6) jVi +>*2 +>'3 +3^4 — 

(Place the scales at equal distances apart in the following order. First 
a scale for then one for graduated with half the unit of ^1, and 
downwards instead of upwards, then one for yg = y^ + 2» then one for 

y^ graduated upwards with the half unit, one for yt — ys + J3» or'® 
for ^4 graduated downwards with the half unit, and finally one for z.) 

(7) It was stated above that in the nomogram for y, = Bjyi -p 
it is possible to choose any parallel scales for3^1 andy2» the position 

and unit of graduation of the ya scale being then determined. This is 
not quite true: occasionally there may be difficulties. What are these 
difficulties, and when do they arise ? 



CHAPTER EIGHT 


RATES OF CHANGE 


8.1 Movement and change 

Everything is (as far as we can tell) always moving and changing. 
Sometimes the change is slow and minute, as for a mountain which 
seems the same from year to year, but is nevertheless being worn away 
3 nd dissolved by wind and rain. Sometimes it is exceedingly rapid, as 
with the lightning flash, which strikes for an instant and is gone. Living 
things especially are full of unceasing chemical and physical activity. 
Even when at rest a plant or animal must constantly breathe; and while 
some spores and seeds can remain dormant for long periods and still 
be able to germinate, for most living creatures change is life and the 
complete cessation of all activity only comes with death. 

Changes which occur naturally are of two kinds, continuous and 
discontinuous. A discontinuous change from A to i is one in which 
there are no intermediate states: for example, melting ice undergoes a 
discontinuous change in state, as there is nothing intermediate between 
the solid ice and the liquid water. It is true that if we take a block of ice 
and slowly warm it the change is continuous in the sense that the ice 
melts gradually, so that the quantity of ice and the quantity of water 
both change continuously. But all the time there is a clear boundary 
between the two states, and it is impossible to confuse one with the other. 
If We Warm the water after all the ice has melted it will then undergo 
continuous changes in temperature and volume, contracting at first 
tintil the temperature has reached 4® C, and expanding after that point: 
3 nd it will go on changing continuously until it reaches the boiling 
point, when a further discontinuity occurs. A philosophically minded 
reader may perhaps challenge this distinction, and claim that if \%c 
could actually see individual atoms at work we should find that all 
changes were continuous; or alternati\ ely he c(»uld argue that according 
to quantum theory all changes were al^rupt. 1 his point is certainly 
debatable, but whatever the ultimate truth may be we have in ordinary 

practice to reckon with both kinds of chartge. 

Now in itself a sudden change can quite easily be specified mathe¬ 
matically: all w'c have to do is to state the values of the quantities we are 
considering before and after the change. I hus we can say that ice with 
a density 0-9 gm/cc changes into water witli a density i •©. A continuous 
change is a little more subtle: it can take place slow'Iy or rapidly, and 
it can speed up or slow down. The mathematical theory dealing with 
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the measurement of rates of change is known as the “differential 
calculus’*. Fortunately—in spite of the terror which some non¬ 
mathematicians feel at the word “calculus”—it is one of the simplest 
of all branches of mathematics as far as formal manipulation is con¬ 
cerned. Once the definition of “rate of change”— or “derivative” as it 
is technically named—has been mastered, there is very little more to do 
except to learn and practise certain simple rules of procedure. 

The calculus has the added advantage that it follows very closely 
our everyday use of language. The simplest kind of continuous change 
is change of position, or movement: and we speak of the rate of move¬ 
ment, i.e. speed, as “quick” or “slow”. These words apply in the 
strictest sense only to motion: but we are accustomed to carry them 
over to all kinds of change, as when we say of something that it changes 
colour rapidly, or that the population of Britain is increasing more 
slowly than that of India. Indeed this use is so natural that it can hardly 
be considered as a metaphor. The words “quick” and “slow” are also 
used in cases where time is not involved at all. For example, we say 
that the main road out of Brighton rises only slowly for the first mile, 
then rather more quickly up to the northern edge of the Downs, after 
which it falls very rapidly down to the Weald. This use of the words 
“slowly” and “rapidly” might be justified by saying that we imagine 
ourselves travelling along the road: but that is not an essential part of 
the situation. What we are talking about is the slope of the ground, 
which could be expressed alternatively by stating its inclination to 
the horizontal plane. All these different kinds of rate of change, 
whether we mean motion, or change of qualities such as colour, or of 
quantities such as mass, or the more indirect use of the word as a 
gradient—all these are equally well covered by the differential calculus, 
and all follow the same rules of calculation. 

8.2 Average velocity 

If a train travels 30 miles in half an hour, or 60 miles in an hour, or 
120 miles in 2 hours, then its speed is 60 miles per hour. More precisely, 
since it will probably not always be travelling equally quickly, we call 
the quotient of the distance gone divided by the time taken the “average 
velocitv over the whole journey: e.g. 20 miles in 20 minutes gives an 
averac:c velocity of 20 miles/J hour = 60 miles per hour. 

Norr: Observe that this is the definition of the phrase “average 
\elociiy . It has a connection with the ordinary use of the word “aver- 
.igc . If a train travels 40 miles between 2 p.m. and 3 p.m., 70 miles 
botwi i n 3 p.m. and 4 p.m., and 25 miles between 4 p.m. and 5 p.m., 
then the total distance covered is 40 4- "/O -(- 25 = 135 miles in 3 hours, 
and the average speed is 135/3 = 45 niiles per hour. This is the average 
of 40, 70, and 25 in the usual sense. Here we have measured the three 
velocities of 40, 70 and 25 miles per hour over equal intervals of time 
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of one hour each. If vve measure them for unequal intervals of time, the 
average velocity over the whole journey need no longer be the average 
of the velocities for each part of the journey. For example, if a walker 
does the first lo miles at an average velocity of 4 miles per hour, and the 
next 10 miles at an average velocity of 2 miles per hour, his average for 
the whole 20 miles is not 3 miles per hour. 

Query: What is it? Why? 

We can readily express the average velocity by a formula. Suppose 
we represent the time (measured in hours) by t and the distance \\ hich 
(say) a train has gone (measured in miles) by j’. y and t will accordingly 
be variable quantities, andy will be a certain function of t, that is to say, 
when we specify a particular time t we can find the corresponding dis¬ 
tance y. We can plot a graph of y against t, which will show us how the 
train is running (the curved line in Fig. 8.1). 



Now lety, be the distance gone at a particular time represented 
by a point P, on the graph, and y 2 the distance gt)ne at time / a. as repre¬ 
sented by the point Pz- ^he total distance travelled between P, 

and P.2 is (y 2 — >'i) miles, the time taken is / ,) hours, and the aver¬ 

age velocity in miles per hour is accordingly 

(>'2 - J.)/('2 -'1) ■ • ■ 

Now if we draw P^Z horizontally as in Fig. 8.1 to meet the vertical 
through P2 at Z, then PiZ (2 h > ZP-. - y -. >'1, and 

V - ZPJP^Z tan / ZP.P. tan . . (8.2) 

where 0 is the inclination of the straight line P\P-i to the horizontal. 
'I’hat is to say r is what w'e have called the “slt)pe” or “gradient” of the 
line P,P2, defined as the tangent of the angle of inclination ijj. 

No'IF. We are supposing that the scale of graduation is the same 
along lioth axes, i.e. that the same length is used to represent a unit of 
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time t horizontally and a unit of distance y vertically. If not, the relation 
V = tan tp no longer holds without a correction given later in equation 

( 8 - 5 ). 

We thus have a simple geometrical interpretation of the average 
velocity v between two times, and is the gradient of the straight 

line or chord joining the corresponding points Pj and on the graph 
of distance against time. 

The quantity fg — the time taken between Pi and Pg: it may 
also be considered as the change in the value of t when we go from Pi 
to Pg. Similarly —j>'i is the change in distance. It is convenient to 
use the Greek letter 8 to mean “a change”, so that we denote the change 
in ty measured by (/g — ^i), by the symbol St, and the change in the 
value of y, (y^ — >*1), by Sy. Here St is to be understood to be a single 
symbol meaning neither more nor less than (fg — ^i): the letter 8 does 
not stand for a number, but for the whole phrase “the change (or 
difference) in the value of”. (The Greek letter 8 corresponds to the 
initial letter d of the word “difference”.) 

To illustrate this notation let us take an example. A train reaches 
Cambridge (56 miles) in i J hours after leaving London, and it reaches 
Hunstanton (112 miles) in 3^ hours altogether. What is the average 
velocity from Cambridge to Hunstanton? 

Here jy, means the distance from London to Cambridge, measured 
in miles, and is therefore 56, and t^ represents the time taken, in hours, 
and is i J. 

y^ means the distance from London to Hunstanton, or 112, and /g 
the time taken, 3 

8y== _V2 — Vi therefore means the distance from Cambridge to 
Hunstanton, and is 112 — 56 = 56. 

Sf = rg — = 3J — ii = 2 

is the time taken on this part of the journey. The average velocity is 
accordingly 

= (vg —.yi)/(^2 — ^i) = 56/2 == 28 miles per hour. 

It follows that in general we can write 

average velocity v = (vo —>’i)/(/2 — ^i) = SyfSt (8.3) 

Again, in this formula it must be remembered that 8^ and St are con¬ 
venient abbreviations for {y^ — Vi) and (/« — ti) respectively and are 
to be treated as single symbols. We cannot cancel out the S’s in Sy/St 
to make it y ‘t, as wc could have done if 8 had been an ordinary number. 

If wc are dealing with two quantities y and t which are connected 
by a purely empirical relation then we can only calculate the average 
^•elocity arithmetically, as we have done above for train times. If, how¬ 
ever, there is an algebraic relationship connectingy and t, such as3’ = 3^, 
or^’ - - / -b t-, then we can do better and find a formula for the velocity v. 
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EXAMPLES 

(1) The lawjy = 3/. 

To say that the distance^ is connected with the time t by the relation 
y = 3/ means that at each particular time (such as ^j) the corresponding 
distance (yi) will be 3 times the time, measured in appropriate units, 
i.e. y, = 3^1. In the same way the distance yg reached after time 
will be 3^2> and accordinglyya —>'1 = 3^2 “ 3 ^i = 3 (^2 “ ^i)- There¬ 
fore the velocity v = (yg —yi)/(^2 ~ ^1) = 3 - Graphically the relation 
y = 3< is represented by a straight line, of slope 3, and therefore any 
chord PiPi coincides with this line and must have slope 3. This relation 
y = 3^ is the law of motion of a point moving with a constant velocity 
of 3 units. 

(2) The law y = / + 

If the relation between the position y (measured in metres distance 
from a fixed point O) and time t (in seconds) is y = / + P, as might be 
true of a sphere rolling down a slope, we can still argue in a similar way. 
We must havey, = y^ = /g + ^2*. and therefore on subtrac¬ 

tion 

Sy = y 2 — yi = (N + (2^) — 

= (?2 ~ fi) + (/g^ — ^i") 

= (t 2 -ti) -+-(G-/,)(G 

= (^2 — /,)(l + ^2 + ^l) 

(taking out the common factor fg — /,). Thus 

^ = (^2 — >’l)/(^2 ^ /i) = I ^ ^2 + ^1- 

This formula gives us a quick method of calculating an average velocity: 
for example the average velocity of the sphere between times ti = 2 

and <2 = 4 would be i + ^g *f = 7- 

Before quoting any further examples there arc two points to notice. 

The first one is that in the formula d — (yg —yi)/(G — ^i)* dis¬ 

tances yj andyg may be measured from any fixed point or origin O on 
the line of motion of the body: they are not necessarily the distances 
measured from the point from which the body begins to move. For we 
are interested only in the difference Sy = y 2 —- yj, or change in position, 
and this will be the same whatever point we choose to measure y from, 
provided that we pay proper attention to signs, measuring distances 
positively in one direction and negatively the other way. Similarly we 
can measure the times and /g from any convenient instant, such as 
noon. 

Besides this there is a paradox which deserves mention. According 
to our definition, if a train goes from London to Hunstanton, and then 
returns to London, the total distance over the whole journey measured 
byy, — y 2 is zero, since the initial and final positions are the same. The 
average velocity is accordingly zero! 'I’he reason is that our definition 
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of “average velocity” takes into account the direction as well as the rate 
of motion. When the train runs from London to Hunstanton it does so 
with positive velocity: when it returns it has negative velocity, since 
the distance j from London is continually diminishing. On balance the 
train has no velocity at all. Now the reader may at first sight think this 
somewhat unfair, and the engine-driver certainly will. It is customary 
to meet this objection by defining “average speed” (as distinct from 
average velocity) as the quotient of the distance travelled, regardless of 
direction, by the time taken. The “ average speed ” of the train will not 
be zero, although its average velocity will. But although this definition 
may save the engine-driver’s honour, it is not on the whole nearly so 
useful as the one which takes direction into account. If we are standing 
on the track, and see a train moving at 6o miles per hour, it is a matter 
of considerable importance to know whether it is approaching or reced- 
ing. Similarly if we are considering the population of a country, and 
are told that it is changing by 300,000 inhabitants a year, this information 
may be quite interesting. But it will be much more valuable if we know 
whether the change is an increase, i.e. +300,000, or a decrease, 
—300,000. So except where the contrary is expressly stated, all velo¬ 
cities, and other rates of change, will be considered to have sign as well 
as magnitude. A positive sign indicates an increase or movement in the 
positive direction, and a negative sign a decrease or movement in the 
negative direction. 

(3) The law j — ijt. 

Here = i/^j, = 1/^2, and 

V. -3-1 = 1//2 — i//j = (/, 

The average velocity between times and is therefore 

^2 == 2, then r — — 1 . The negathx sign indicates a move¬ 
ment in the negative direction. If for example the distance y was 
measured north ot a fixed point O, then the law of motion y = ift 
indicates a southward movement, towards the point O (for positive/). 

So far the quantity y has been a distance, and the quantity v = 
SyjSt — (v« Vi)/(^2 — ^i) a velocity. But a similar definition vail 

apply to any changing quantitv. If is its value at time /j, and Vg its 
value at time t., then we call = (3.3 — 3q)/(/, — fj) = Sy/Bt the 
“average rate of change of v between times ti and #3”. 

(4) animal is growing in such a wav that its weight v grams is 
connected with its age i weeks I>y the relation v = 2 + What is the 
rate of growth between times /, and t^} 

If 3’, is its weight at time /j, and3'2 at time ig. then 3^, = 2 + 6/1^, 
y., = 2 + 6/0^ and 

y., - y, 6 tJ - 6 /,3 ... 6(/./ - t^^). 
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But = (tz — ?i) (tz^ + tzti + ^1^) by equation (3.5), so that 

y 2 -yi = Hi 2 ~t^){tz^ +t^tz +/r), 

and 

® = {y 2 —yi)Kt 2 — fi) = t,^), 

measured in grams per week. For example, between = i and tz= 1-5 
weeks it grows at an average rate of 6 {fi^ + ^2 + ^2^) = 6(1' + 1-5 

+ 2*25) = 28-5 grams per week. This can be readily verified directly. 
After I week its weight is 8 g, after 1-5 weeks it is 22 25 g. The increase 
in weight hy = 22*25 — ® = ^ 4'^5 8* the time taken Bt ~ 1-5 — i = 
*5 weeks, and the rate of increase is By/St — = 28-5 grams per 

week. 

PROBLEMS 

Find the formula for the average velocity or rate of change of a 
quantity^ when it is related to the time t by the following laws: 

(1) y = 2 4- 3-5^ 

(2) y = I — / + i// 

(3) y = 3/^" 

(4) y = 2/(1 + 1) 

(s) y = U+ 4- 1) 

(6) jy = I +t -\-t^ 

(7) y= 1/(1 — t^) 

Verify the following formulas for rate of change for the laws stated 

(8) Xf y ~ I, V = By I Bt = o 

(9) If y = V := I 

(10) If y = V = + tz 

(11) If^; = V = iz^ -\-tit2 + 

(12) If y = t*, V ^ tz^ t2^ + tztr -f- 

(13) Generalize to the lawjy = where u is a positive integer. 

(14) If>'= \/t,v= iliVt-i -b Vti) 

(15) If^- = t~\ V = —ijt, tz 

(16) If y = V = — {tz 4 - ^)/^- 

Instantaneous velocity 

It is very natural to feel that it is sensible to say not only that “this 
automobile has been travelling at an average speed of 25 miles per hour 
for the last 5 minutes” but also that “it is moving at this moment at a 
speed of 23 miles per hour”. In other words we feel that there is such a 
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thing as a definite speed at a definite instant, as well as an average speed 
over an interval of time. 

How are we to express this mathematically? We cannot take the 
definition of average velocity (^3 —— ^i) and put for 

that would imply = v^y and leaves us with a fraction 0/0 which has 
no meaning. We can, however, imagine the velocity calculated over an 
interval of time ht which is very short, but not absolutely zero. 

To see how this works out let us consider the sphere which we imag¬ 
ined rolling down a slope according to the law y = t We have 

already shown that the average velocity of the sphere benveen time <1 
and time t ^ is i 1 2 tiy 2i formula which will save us some calcu¬ 
lation. 

Now the average velocity beUveen time fj = i and a time ^3 shortly 
afterwards is given by the formula v = z 1 2- Thus the average 
velocity between times i and i*oi seconds is 3*01 (metres per second); 
between = 1 and ^3 = l ooi seconds it is 3-001; between times <i=i 
and ^2 = l ooooooi seconds the average velocity is 3-0000001. The 
shorter the interval of time over which the velocity is calculated the 
nearer it approaches to 3. It is therefore natural to say that the actual or 
instantaneous velocity at time =s i is 3 metres/sec. Similarly, if we 
take = 2, ^2 = 2-001 the velocity is 5 001; = 2, = 2 0000001 

gives a velocity 5 0000001, and keeping /j — 2 the nearer tz approaches 
2 the nearer does the average velocity approach 5. Thus the instan¬ 
taneous velocity at time ti~z seconds is 5 metres per second. In 
general we shall say that if as the interval of time S/ = ^3 — over 
which it is calculated becomes shorter and shorter the average velocity 
V = ByjSt approaches more and more nearly some “limiting” value Vy 
then V is the “instantaneous velocity” at time ti. In symbols 

F as Bt —> o, i.e. as ^2 ^ 

We can now find a general formula for V in the case of the moving 
sphere. Such a general formula i’ = 1 +^2 + ^1 has already been 
obtained for the average velocity between times ti and ^3. Now it is 
clear that the nearer approaches ti the nearer does v approach 
I 4- -f- = I + ztj. This is therefore the required instantaneous 

velocity at time Since there are not now two distinct times ti and ^3 
to consider, but only a single time we can drop the suffix j as super¬ 
fluous and say that the (instantaneous) velocity at time f is F = i -h zt. 
Thus when i = i, F = 3; when t — z, F = 5, as already found; when 
^ = 3 ’ — 7 > so on. Again if y is some measured quantity other 

than a distance, we have defined v = ByjBt — (y« —yi)/(^2 — to be 
the average rate of change of the quantity y between the times and 13. 
Again we cannot put = tz in this fraction, or we should obtain 0/0. 
hut if as the interval fo — becomes very short, so that tends to ^i, 
the average v approaches a certain limiting value F, we can speak of F as 
(by definition) the “instantaneous rate of change” ofy at time Thus, 
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considering the animal of Example (4) above whose weight, y grams, is 
given by the formula = 2 + 6/®, where t is its age in weeks, we have 
already shown that v = + ^i")* Ehe nearer approaches 

to the nearer v will approach to 6(/i- + + /,2) = Drop¬ 

ping the suffix 1 as no longer necessary-, we see that the instantaneous 
rate of growth at t weeks is V — i 8 t^ grams per week. 

As a numerical illustration of this let us put = i. Then if / j = 
I’Oi, the average rate of increase in weight between times and /g will 
be z; = 6(^2^ 4 - tzti + = 181806 grams per week. If we shorten 

the interval by taking /g = l ooi, we get v = i8-oi8oo6; if we put 
<2 = I’oooooi, to get a very short interval indeed of -oooooi weeks, then 
V = 18*000018000006, a value which differs quite inappreciably from 
the instantaneous rate F = 18 at f = i. 

Unfortunately there is a serious defect in the idea of “instantaneous 
velocity”; it is one which it is utterly impossible to check empirically. 
In discussing the case of the rolling sphere, we calculated from the for¬ 
mula ^ 4- that between = i and ~ I'Oi seconds the average 
velocity was 3*01 metres/second; this means that the distance travelled 
was *01 X 3*01 = *0301 metres, and that could be verified by careful 
measurement. But w'hen we shorten the interval of time to ooooooi 
second and say that the average velocity is then 3 0000001 metres/sec, 
we are asserting that the distance travelled is 00000030000001 metres in 
this short time. Although such a statement must be true if the law of 
motion y = t is accurate, it is nevertheless quite impossible to 
check in practice. 

In the case of the hypothetical growing animal which is supposed to 
obey the law jy = 2 -f the situation is even more serious. This 
equation may fit the weights observed very well if we take only daily 
readings. But a more detailed consideration will show that it simply 
cannot be true when w'e consider shorter intervals of time. For it gives 
a steadily rising weight, whereas in fact the weight will increase very 
rapidly at feeding time, and fall in between meals. And if the equation 
y = 2 is untrue so also is the rate of change of weight F = 18/- 

derived from it. 

Fortunately these objections do not really matter very much in prac¬ 
tice. We may be unable to verify that a body is moving with instan¬ 
taneous velocity i + 2/, but for all practical purposes it behaves as if 
that was so. We may be actually wrong in supposing that the animal’s 
weight is always increasing at the rate 18/". But in the long run we shall 
not be very far wrong, and for many purposes it may be considerably 
easier to take a deliberately simplified model rather than to worry over 
all the irregularities in detail. Such an idealization is common in many 
branches of science. We can even treat the population t)f a country as 
if it was a continuously varying quantity. We kinnv of course that in 
fact it can only change by steps of i i for a birth or immigration, or —i 
for a death or emigration. But among a population of millions such 
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individual changes are negligible. It is possible to talk of a “contin¬ 
uously varying population” provided that the underlying simplification 
is not forgotten. 

To summarize: 

(A) If y is any changing quantity, i.e. in technical language, any 
function of the time i, then the average rate of change of y between 
times 11 and 12 is defined to be 

v = Sy/St = (3-2 —yi)/(t.2 — /i) 
whereis the value of^^ at time Ij, and3^2 value at time 

(B) If the relation between 3' and t can be expressed in mathematical 
terms, it will frequently happen that when the interval of time St = 

is made shorter and shorter the average velocity v will approach 
some definite value V. F is then called the “instantaneous” or “actual” 
velocity at time 

(C) This idea of instantaneous velocity is a purely theoretical device; 
strictly speaking it can never be completely verified empirically. 

(D) However there are many situations, such as the calculation of 
the velocity of a moving body, where the theory agrees with reality to 
as close an approximation as our measurements will allow. And there 
are also many situations which we can treat for all practical purposes as 
if there was an instantaneous rate of change, even though we know that 
this is not absolutely true when we consider them in detail. 

8.4 Algebraic functions 

Once a formula has been obtained for an average velocity v, it is 
usually quite easy to deduce the value of the instantaneous velocity V. 
For example, we have already shown that if the law of motion is 
y= i//, then the average velocity is “y = — If f 2 approaches in 

value, then approaches — or, on dropping the suffix j as usual, 

we sec that the instantaneous velocity at time / is F = — i/t-. 

.Again if the law of motion is 3’ = 1/(1 + /). we have 

Sv -- Vo - V, -= 1,(1 4 - ? 2 ) — 1/(1 + ^) 

= —(/a — 4- ti) (i 4- t..); 

average velocity 7’ = Sy/St — —1/(1 4- /j) (i 4- 
instantaneous velocity — I' — —1/(1 4-0"* 

'Phis (.lirecl method of finding instantaneous rates of change can be 
applied to a great variety of expressions—in fact to all algebraic fractions 
obtained by dividing one polynomial by another. The most important 
application is to laws of the form y — t". Here we use the results of 
problems (8) to (ifi) of Section 8.2. 

Example 1 v - i for all t 

Average velocity r — o 
Jnstantane<'>us velocity J ' - - o 
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Example 2 

y = t 

Average velocity v — 1 

Instantaneous velocity F = i 

Example 3 

y = 

Average velocity v — ~\- ti 

Instantaneous velocity F = 2/ 

Example 4 

y = 

Average velocity «> = + ^2^1 + fi" 

Instantaneous velocity F = 3/- 

Example 5 

y = 

Average velocity v = 

Instantaneous velocity F = 4Z® 


These examples suggest the general result: that for the law of motion 
y — the instantaneous velocity is 

F .... (8.4) 

For the case in which n is a positive integer this general formula can 
be proved as follows [using equation (3.7)]: 

= (/, _ 4 . -f 

Average velocity = hyjht = -f /o’"' 4 - • . • + 

On the right-hand side there arc n terms, and as approaches /, each 
of these terms approaches so that their sum tends to 7/r,"-h 

Therefore V = 

We have already shown that for the law 4’ ^ the instantaneous 

velocity is given by F = this agrees with (8.4) when n = ~i. 

Example 6 The law y = t-’^. 

Average velocity v -- —(^2 “> ^-i 

Instantaneous velocity F -- —2///* -- —2t~^ 

This agrees with (8.4) with n = —2. 'riiese examples suggest that 
(8.4) may be true for all values of n, not only positive integers. It will 
later be proved that that is so. 


PROBLEMS 

(1) Show that (8.4) is true in general if ti is a negative integer. (Put 
” ~ theny = i/P". Find v, and thence F.) 

(2) Find F for the following laws of motion; y~ 1/(1 4-F), 
y ~ Vt, y = lIVi- Show that the last two agree with formula (8.4). 
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8.5 Derivatives and differentiation 

Corresponding to any specified law of motion or change, relating a 
quantity to the time ty there will be an instantaneous rate of change of 
y, which we have called V. Technically V is called the “derivative’*, 
“derivate”, or “differential coefficient” of y with respect to the time t. 
The operation of finding the instantaneous rate of change is known as 
“differentiation”. 

This process can be illustrated graphically (Fig. 8.2). Let us draw 
the graph of y against the time ty and let Pj {^j, y^) and Pg (fg* jXs) t>e 



any two points on the graph. Then the slope of the chord PjPg is the 
average rate of change between times and /g. Now imagine Pg to 
move along the curve towards P^. As it does so the chord PiPg will 
rotate about Pj and will approach a certain limiting position which we 
call the “tangent” at P, (from Latin tatigens == touching. This use of 
the word “tangent” is, of course, different from its use as the name of a 
trigonometric ratio, tan 6 . However, there is a historical connection 
between these senses, but one which need not concern us here). As the 
chord tends to the tangent the slope v of the chord must tend to the 
slope ol the tangent, i.e. the derivative or instantaneous rate of change V 
— the slope of the tangent - tan where ^ is the angle between the 
tangent atid the /-axis. 

Now this is indeed the sort of relation we would expect. For where 
y is increasing, the graph will slope upwards (towards the right), ^ will 
be positive and so will tan ^ (Fig. S.3). 'I'he more rapid the increase, 
the greater the slope will be. If y is decreasing, the graph is falling; 
0 is negative, and so is tan 0. Where 0 — o, and the tangent is hori¬ 
zontal, y is momentarily stationary, neither increasing nor decreasing. 
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This relation can be used as a means of roughly estimating the rate V 
of change by drawing the tangent by eye, measuring 4 > and finding tan ^ 
from tables. 

So far we have used the letter V for the rate of change or derivative 
ofjy. This notation does not bring out the relationship between V and 



Fig. 8.3—The inclination ^ of the tangent compared with the 

rate of change 

y. It is usual to denote the derivative of^ with respect to t by the symbol 
i.e. 

V = D,y. 

Here the symbol Dt is shorthand for the phrase “the derivative with 
respect to t of”. It is a symbol which has no meaning standing by itself; 
it is the complete symbol "D,y" which means something, viz. the rate 
of change of y. Often the suffice t can be omitted as being understood, 
and we can write simply "Dy". 

This is not the only way of writing a rate of change. Another symbol, 
even more commonly used, is dyldt. The reason for such a notation is 
this: we have already shown that the average velocity v is hyjht, where 
means “the change in y” and ht “the change in To show the 
relationship between the average velocity v and the instantaneous 
velocity V the latter is written as dyldt: but although this looks like a 
quotient “dy divided by dt" it is to be interpreted as a single symbol; 
neither dy nor dt are to be taken as having any meaning of their own. 
(Some books give an interpretation of dyjdt wliich allows dy and dt to 
be regarded as actually existing quaiuities, but it is rather a complicated 
2nd artificial interpretation, and will not be considered here.) 

Other notations for V which are fairly often met with arej^' (read as 
dash” or ''y prime”), yt and y. ('rhe last is read as “y dot” and is 
used only when the variable t is the time.) Also ify is written as a func¬ 
tion of / in the form y = f{t), then V is written as V =f '(t). We men¬ 
tion these notations because the reader must be prepared to meet them 
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in mathematical books. It may be at first surprising to find six different 
ways of writing the same idea. But the biologist may be reminded that 
many species are naturally polymorphic—and calculus notation is poly¬ 
morphic for what may be similar reasons, that different symbols fit 
different circumstances. In this notation we can write the results so far 
obtained as 

Dt V' = d{t^)(dt = 

D, (2 + 6 ^ 3 ) = d{2 + 6 t^)ldt = 18^2 

A [1/(1 +/)]= -1/(1 

and so on. 

We can now generalize our ideas to include the more metaphorical 
cases where time is not involved. Suppose that the height {h metres) 
and weight {w kilograms) of a growing child have been found to obey 
the law = 13 h^. Suppose that during a certain interval of time h 
increases from /ij to Aj, and w from Wj to et>2 : then the increases in 
weight and height will be hw= and 8A = Ag — Aj respectively. 

We can call the quotient hwjhh the “average rate of increase of to 
with respect to A”. If A, = i-oo, = 1*20, then SA = *20; == 

13-00, — 22-46, and hw = 9*46, so that hwfSh = 47*3 kilograms 

per metre. Now if 8A becomes very small the average rate of change 
hwjhh will approximate to a number we call the “actual rate of 
change” or “derivative” of w with respect to A, and denoted by the 
symbols Di,zo or dzvidh. The method of calculation of the derivative 
is exactly the same as before. 8w = — zvi = 13A2® — 13^1* = 

13 (^^2 ~ "!■ ^M"). so that SzojBh = 13 (A2^-|-A2Ai-t~Ai2), 

and in the limit when h.y tends to Aj = A we have = 39A2. 

By drawing the graph of w against A, choosing equal units of gradu¬ 
ation for ft’ and A, the derivative D^w can be interpreted as the slope 
tan of the tangent. If different units are used for A and to, so that a 
unit difference in A corresponds to a length on the A-axis, and a unit 
difference in w has length t/^., then the formula is 

D„w = {U^ tan i/j)IU^ . . . ( 8 - 5 ) 


PROBLEM 

(i) Draw the graph ft’ = 13A2. By drawing tangents at the points at 
which A -8. i-o, and 1-2 respectively estimate the derivatives at these 
points. Compare with the theorcticallv calculated values. 

8.6 Limits 

We have already met several situations of the following typf: x 
and V arc two variables connected by a functional relation y = 

As X approaches a certain value, say A', v approaches more and more 
nearly to another value, say Y. I-'or example the nearer x is to zero, the 
nearer (sin x)jx is to the constant H (Section 6.11). The nearer St is to 
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zero, the nearer ByjSt is to K = Z),j. These facts can be expressed in 
two ways, either as 

“y tends to V as a:* tends to X'^ 

or in symbols 

^ y as A’ ^ X'\ 

or alternatively as 

“ Y is the limit of y as x tends to X'* 

or in symbols 

Y = lim y. 

x~>-X 

These alternative phrases are identical in meaning. Thus 

sin x/x H as a; -> o 
or lim (sin xjx) — H\ 

X O 

and hyl St V as St -> o 

or lim (Sy/St) = V. 

St-*o 

We also have a very similar limiting process in the case of an un¬ 
ending decimal. The equation J = '3333 • • • means that the decimals 
of the sequence *3, ‘33, '333, '3333 • • • approach more and more nearly 
to If we write Wj — -3, 1/2 = -33, W3 == ' 333 , etc., then we say that w,. 
“tends to i as r tends to infinity” or in symbols 

Ur -> i as r -> CO 

or lim Ur = A- 

T —► CO 

The phrase “tends to infinity” is simply a shorthand way of saying 
“becomes larger and larger without bound”, and has no philosophical 
implications about the nature of infinity. 

Now we have assumed in our calculations certain reasonable 
properties of limits: that if approaches in value, then 
approaches 2/,, and that /j/g approaches It is not difficult to justify 
some of these properties in a general sort ot way. For example, the 
difference between (/, f- and 2?, is (/, -f- t^) — 2/, so 

that it is immediately evident that the smaller the dilference t.> - - / j is 
the smaller is {t^ -f t^) — 2^,. Again the dilference between and 
is /j/g — so that if wc keep /, fixed the smaller 

(^2 — /,) is the smaller is {t^t^ — /,^). However, if we want to make 
these into completely formal proofs we have first got to translate our 
definition of a limit, “the nearer .v apj^roaches X, the nearer y is to F” 
into more exact terms; and this is a little more tricky than might at first 
be expected. So we shall content ourselves here with an explanation of 
what we can expect to lie true, without going into the details of the 
proofs (Section 8.15) which only confirm the cornmonsense view. 

If X tends to X then x- tends to X'^, x -\- C tends to X -\- C (where 
C is a constant), and C’.v tends to CX. Also i .v tends to ijX pnnided 

N 
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that X is not zero — a reasonable condition, for \\X is not defined when 
X = o. It is always true that antilog x antilog Xy sin x sin Xy 
gx provided that X is positive it is true that ‘s/x ^/X and 

log x log X. This again is reasonable since 's/X and log X are only 
defined for positive X. It is reasonable and true that if y and z are 
functions of x, and y Y and s’ —> Z as a* -> X, then y z Y Z 
andjyxr YZ. Provided that Z o it is also true that yjz -> YjZ. 

If j is a function of x and zv a function of y, and if ^ Y as x X 
and w —> W as y Y then it is true that zv W as x -i’ X. For 
example, x might stand for the age of a child, y for its height (which can 
be considered as a function of a) and zv for its weight (which can be con¬ 
sidered as a function of and so as a function of x). 

There is one small caution. It is tempting to suppose that if y is 
given as a function of x, say y = F{x), and i( y Y as x Xy then 
Y must be the value of y corresponding to x = X, i.e. Y — F{X)y pro¬ 
vided that F(X) is defined. We have seen that this is true if the relation 
is y = x^y or y = sin Xy or y = antilog x. But occasionally it is untrue. 
Suppose that x is the weight of a letter, in ounces, and y the postage 
payable in pence. Then if a- = 3, jy = 3; if a = 2*i, ^ = 3; if a = 
2 00i,jy = 3; for any weight exceeding 2 ounces by however little, jy is 3. 
It follows that if A > 2 but .a -> 2, ^ 3 as a limit. But if the weight is 

exactly 2 ounces the correct postage is 2^d. and not 3d. 

The reason for the discrepancy is a sudden jump in the postage rate 
at 2 ounces. (In case the reader feels inclined to argue that this is an 
artificial example we would remind him that such discontinuities do 
occur in nature: at the surface of a solid body the density seems to 
change suddenly, at least on an everj’day scale of measurement. Seen 
from a molecular point of view the situation is no doubt different, but it 
is scarcely simplified.) 

Fortunately such discontinuities rarely interfere with calculations 
and, in brief, “limits” (in the mathematical sense of the word) behave 
exactly as one would like them to. 


8.7 Differentiation of logarithmic and trigonometric functions 

We can iu)w find the rate of change of logarithms, antilogarithms, 
cosines and sines. 

Consider the relation y = log t (using any system of logarithms). 
As usual Ictyi log/i.ys = log ^y =>'2 —y'u §/ = ^2 “ Then 


5 y == log — log 
= log — 

- - 

- log (i + 5 / 7 ,) 
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whence the average rate of change is 


8y ^ log (i + 
Bt Bt 


_i_ log^(i + Bt/tj) 
'Btjt, 


Now we know that when x is small log (i + .y) is approximately Mx, 
where M is the modulus of the system of logarithms. In fact we know 
that log (i -f x)/x -^M as a: -> o (equation 6.29). Let us put x = Btjt ^; 


then as S/ ^ o, so must x 


o (keepings fixed). Thus 

b///, 


-> M, and so, as Bt o, the average rate of change v —> A///, = F, the 
instantaneous rate of change. As usual we can now drop the suffix j 
and put 

Di log t = Mjt .... (8.6) 


In the case of common logs M = •4343. For natural logarithms A/ r, 
and the formula takes the simple form 


Dt\n t = ijt . . . . (8.7) 


For the antilogarithm function y = antilog t we have, using the 
same notation 

hy =y^~y^ 

= antilog ti — antilog 
= antilog (^1 + Bt) — antilog t^ 

= antilog t^ . antilog Bt — antilog t^ 

= antilog t^ . (antilog Bt — i) 


V = ByjBt = antilog t^ . (antilog 8/ — i)/8/. 

Now as Bt o the expression (antilog Bt — I)/S^ tends to A/“* (from 
equation 6.28), so that v tends to antilog fj. Dropping the suffix 
we obtain 

Di antilog t = A/“^ antilog t . . (8.8) 


In particular for the natural antilogarithm or exponential function, 
exp t = 

. . . . (8.9) 


This function is its own derivative. 

These results can be explained geometrically. On the equiangular 
spiral defining the system of logarithms, with angle let O be the 
centre, and /, P,, and Pg points with polar co-ordinates {1, o}, 
{r„ 0 ^}, and {rg, 0 -^} respectively. Draw the arc PiZ of a circle 
with centre O to meet OP^ in ^ (Fig. 8.4). 1 hen 8r — rn ^ r^ ~ 
OP^ - OP, = OP2 - OZ = ZP2, and BO - 0 ^~ 0 ,^ ^ lOP^ - 

^ /OP, = z. P,OP2. 
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The triangle PyZP^ has angles of 90° — 90®, and ^ at 2', and P* 

respectively, and the length of the arc PxZ is Now when ^8 

becomes very small (and therefore so also does 8r) the sides of the tri¬ 
angle PxZPz become very nearly straight, and so the ratio PiZjZP^ 
tends to tan <f> in the limit. That is, tan <f>. On dividing 

both sides of this equation by Hr^y and dropping the suffix from as 
usual, we see that S0/Sr-s-(tan = Af/r, and so also hrjhQ^rjM. 

Since 8 = log r, and r = antilog 0, this means that Dr log r = Mjry 
Dq antilog 8 = (antilog d)IM. 


P. 



Fig. 8.4—DifFerentiation of logarithms and antUogarithms 

'i'o differentiate the relation v = cos t we can proceed as follows. 
Jo cos fo = cos {tx 4- §0 -- cos . cos ht — siii /, . sin 8 t by the 
addition law for cosines (5.11). Now when 8 t is small wc know that 
cos St is very nearly equal to 1, and sin St to H. St, so that this reduces 
to Jo ~ cos ?j — // sin ti . St. By definition ji = cos fi, and so 

Sy ~ y.^ — J1 — — sin f j . Sf 

and 

Syl St 1=^ — H sin ti. 

In the limit as St - • o we have 

Z>, cos / = — //sin t . . . (8.10) 

'This holds whatever units t is measured in, provided that we insert the 
corresponding value of H. But in the calculus it is customary to measure 
angles in radians, and then H — i, so that 

D, cos / = — sin t (for radian measure) . . (8.11) 

Proceeding in the same way for sines we have, given that j = sin t, 

Jo = sin (/, + 8^) = sin . cos St + cos f, . sin St 
sin r // cos t^ . St when St is small; 



RATES OF CHANGE 


l8l 

>»i = sin ti, and so ~ yz — yi — ^ cos f| . 8 t 
ByjBt cs: H cos ti; i.e. on dropping the suffix from t^, 

Dt sin t = H cos / 

= cos t (for radian measure) . (8.12) 

An alternative proof of the formula Dt cos t = H sin t runs as 

follows. We have ^ i + ^i) — i (^2 — ^) = i (^2 + ^i) — 

and ^2 = i (^2 + ^1) + M^2 ““ ^1) = J (^2 + ^1) + i Sf. Therefore by 
the addition law, \i y = cos t, 

y^ = cos ti = cos i (^2 + ^1) • cos ^ 5 / — sin ^ (^3 -f ^i) • sin \ 8 t 

jVi = cos = cos \ (/« + /j) . cos i 4- sin J (/« + ^i) • sin ^ 5 / 

and on subtraction 

By = y i — j, = —2 sin ^ (^2 “f" ^1) • sin 4 ^/. 

ByfBt = — sin i {t^ + fj) . sin (iSr)/(iS/) 

Now as 8t o, ti ^ ti = /, sin i (ti 4- /■,) —> sin t and sin (iS/)/(iS/) 
“>• H, so that ByjBt -> — H sin t = D,y. 


PROBLEMS 

(1) By a similar method prove that D, sin t = H cos t. Can you 
see any advantage of this second method of proof as compared with 
the first? 

(2) Using the fact that cos d and sin 6 are the x and y co-ordinates 
of the point P with polar co-ordinates {i, 6 ) on the circle r = i, 
show graphically that Dq cos 0 — —H sin 0 and Dq sin 6 = H cos 6 . 

8.8 Derivative of a sum 

Quite complicated expressions, even those such as [/' 4- V^]~ or 
[i{log (i 4- can be differentiated by means of the direct 

approach of finding the average velocity By/Bt and taking the limit as 
^t -*■ o. However, there is a simpler approach. We can think of such a 
complicated expression as built up by a series of steps, those of addition, 
multiplication, taking logarithms, and so on. An elaborate piece of 
machinery may be difficult to understand at first. But by decomposing 
it into its constituent rods, wheels, levers, etc., the whole will become 
clear. In the same way, by taking any expression step by step we can 
differentiate it, provided that we know how to deal with each step. 

The first comliinatirni we consider is the sum. Let y and z be two 
variable quantities, each of which is a function of /. Let w be y | z. 
What is the rate of change oi w} 

Suppose for example that two rivers meet. Let be the total quan¬ 
tity of water which has flowed down the first river to the junction, since, 
Bay, noon, and z the quantity which has flowed down the second river. 
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then tv will be the flow down the combined river. If t denotes time, 
then Dty is the rate of flow do^vn the first river, say 10 cubic metres per 
second, DfZ the rate of flow down the second river, say 20 metres®/sec, 
and DtW the combined flow, which clearly enough is 30 = 10 -{- 20 
metres®/sec. Thus 

Dtw ~ Dt(y + = ^ty + . . (8.13) 

i.e. the derivative of a sum is the sum of the derivatives. 

Formally, let jVi, and Wj be the values of^, Zy and tv respectively 
at time /j, and jVg, z^i and zwg the corresponding values at time Then 
w-^—yi + Zi, tV2 = yz ^2> so by subtraction 

Sw — W2 — tVi = (y2 — yi) + (^2 — -s^i) =83^ + Sz. 

On dividing by Bt we have the relation between average velocities 

Btv/Bt— By/St + BzjBt. 

On letting Bt o, 

DtW = Dty + DfZ. 

The formula clearly generalizes to any sum: 

■Df (>+ ^ +« + •*•) = ^ty + + • • • 

EXAMPLES 

(1) The rate of change of ? is i, that of t'^ is 2,t. Therefore 

D, (/ + ^ 1 _|_ 2/. 

(2) Dt {r- + sin i + cos t) = 2t H cos t — H sin t. 

(3) Since any constant K has zero rate of change, Dt {y K) = 
Z),j, i.e. (j [ A.') and y always have the same derivative. 

8,9 Derivative of a product 

Consider now the j:)roduct w = yz where y and z are variable quan¬ 
tities. T'o make the matter more specific, suppose that OYPZ is a 
rectangle of sides O V' = y and OZ — z. Then its area is = yz. If 
the sides v and ar are changing, what is the rate of change of w? 

Let be the rectangle at time and OY2P2Z2 its new 

shape at time /o. Then OF, = y,, OY2 ~ >’2, and so Sy = y2 — yi 
-- Lii\. (Fig. 8.5). Similarly Bz — Z^Z^- Now Sw = eug — = 

the area F, Y2Z.^Z 'Fhis area can be split into two parts by 
producing to meet FoPo iri Q. 'Fhcn the rectangle 

area ir,Sy and Z^OP2 has area y ^ 5ar: i.e. Bw — ar^Sy + y 2 Sa. On 
dividing by 0/ we obtain 

Bzv'Bt = JSj , BylBt -r V-*. BzJBt 
which is the relation between average rates of change. 
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0 y. n rfy y. 

Fig. 8.5—The change in the area of a rectangle due to changes in its sides 

We can also derive this algebraically: 

hw — W2 — 

= 72^2 —yi^i 

= (>'2^2 —>'2-1) - 1 - — ji-i) 

= >' 2(-^2 — - l) + -1 (j 2 — Jl) 

= ^2 ^y 

and hwjht = Zi ByjBt +>’3 BzjBt. 

Now as 8f ->• Oy -> >'1 = y* dropping the sufHx, and z^ = z, so 
that 

Dfto = Dt{yz) ^ z Dt}’+ y Z>t^ • • ( 8 -^ 4 ) 

In words, the derivative of the product of tu-o quantities is the sum of the 
second times the derivative of the first plus the first times the derivative of 
the second. 

EXAMPLES 

(1) Differentiate t In t 

D, (t \nt)-=\nt . D.t i t .Dflnt 

' In f 4 - I. 

(2) Z), [(i + t) sin = sin / . D, (i + /) r (i 4 0 • ^ 

sin t 4- H cos t. 

(3) Prove, using the product rule, that Dt t~ 2/, D, t'^ = 3/^. 

/>, (2 D, (//) =--- tDft 4 - tD,t - / 4 t 2/ 

D^t^ = /),(//2) 4 tD^t^ - t^\t. 2 t 3/". 
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(4) What is the derivative of Ky^ where is a constant ? 

We know that DiK = o; so putting K in place of z in (8.14). 

Di{^Ky') = KD^y .... (8.15) 

i.e. the derivative of a constant multiple of a quantity is the constant times 
its derivative. 


(5) Differentiate 5^^. 

D, iSt^) = sDd^ = 5 . 3^2 = 15^2, 

(6) Differentiate 3 + 4^ + t^‘ 

(3 + 4^ + ^") = + ^((40 "h ^tt“ 

= 0+4 

= 4 + 2f. 

* 

It is a little more difficult to generalize the product rule to the pro¬ 
duct of 3 or more variables. The simplest way is perhaps to divide 
(8.14) through by w ~ yz; we obtain 

Dtzo _ Dty ^ DtZ 

zv y z 


Now suppose that z in turn is the product of two variables u and x, so 
that z = ux, iv = yz = yux. Then we must have 




u 




X 


or on substitution in the former equation 

Ifzv ^ Dt O’».v) ^ I^y ^ DfU ^ DfX 
tv yux y u ' .v 

This can be similarly e.xtended to the product of any number of vari¬ 
ables, e.g. 

D,(v».vs) DtV , D,u , DtX , D,s 

- 

yuxs y u x s 

For 3 variables we find, on multiplying (8.16) throughout by yi/.v, 

Dtiyus) ^ uxDty -}- yxDtU -• uyD,x. 

In the same wav 

wt 

Di{yiixs) = uxsDty + yxsDfU + yusD^x + yuxDtS 



FURTHER EXAMPLE 

(7) Find the rate of change of a' = / sin / . cos t. 

Ditv = sin ? . cos / . Dft + f cos / . D, sin t -h t sin t . Dt cos t 
— sin ^. cos / 4- H t (cos /)- ~ H t (sin if 
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PROBLEMS 

Find the rates of change (with respect to t) of the following expres¬ 
sions: 


(1) 2 -h 6^2 + 7/3 

(2) (i + 2 t)Vt 

(3) sin / 

(4) 2 ^ . In ^ . sin t 

(5) Vt . In t. 


8,10 Derivative of a quotient 

If y, z are functions of what is the rate of change of w = yjz} 
At time we have ~ y^!^\^ ‘itid at time ^2 = ytl^2- So 

8 tv ^ w. ~ zv. ^ vJz. — yJz, 


= y 




(by reduction to the common denominator 


= y-?^jjz y. >_+ y±^ ~y 

Z1Z2 

_ Zjhy ~ y ^Sz 

And so on division by 6 t we find the formula for the average rate of 
change 

On letting ht o, z.> - ' z, 
rate of change 

/),zv D( 




z, V| V \ve obtain the instantaneous 


o 


z/),y — yDt!^ 
^2 


• (8.17) 


Given the derivatives of sin / and cos t this enables us to find those 
of the other trigonometric fimciions. For example tan t -- sin t/cos i, 
80 that 


D, tan / - 


cos / . D, sin / sin t . I)i cos t 


(cos 0" 

_ H cos t .cos t -I- IJ sin / . sin t 

(cos ty 

= ///(C<JS if 

[since (cos /)- H- (sin ty i] 

-V/(sec ty 
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EXAMPLES 

(i) Find the rate of change oiw — In ^/(i + <), 

We first use the rule for a quotient, treating zo as yjz where y = In t 
and sr — I + /. Thus 

O „ ^ -yPtz 


_(i -ft) Di In t) — In t. Dt (i -f t) 

Now to find Dt In t) we use the product rule 

Dt {t- In t) = In t . Dt t- + Dt In t 

= \nt . 2 t -\- tyt 
= 2t In t -f t. 


To find Dt {i + t) we use the sum rule, 

Dt{i -f /) = Z>( I -f Z), t = o + 1 — 1 

Thus finally 

(i + i) (2t In f -f t) — t* In # 

_ 2t In t + 2t- In t + t H- t= — In t 


2t In t 4- t“ In t -f t -f- t’ 

t (2 + 0 In t -f t (l -ft) 
(I + 0 = 


(2) Find the rate of change of z~^ 


ijz, given the rate of change of 


Put y = I in the quotient formula, then Dt}' = o 

Dt{i!z)= -{Dtz)lz^- . . . (8.18) 

(3) Difi'erentiatc sec t. 

\W* know that sec t = i/cos t; so putting z = cos t in (8.18) we have 
Dt sec i = —H (cos t)“- (—sin t) = H sec t . tan t. 

(4) Differentiate e~‘. 

e~* = I/e^ so putting z = in (8.18) 

Dte-*= -(Dte^)l{er- 
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(5) Differentiate sinh t and cosh t. 

By definition 

sinh f i (e‘ — e~*) 

Df sinh t = ^ Dt{e* — e~^) 

= 

= \ {e^ = cosh t. 

cosh t = i (e* + e~^) 

Dt cosh t = ^ Di (e* + e“*) 

= ^ (e^ — e~‘) = sinh t. 

(Compare with the formulas Dt sin t — cos /, D( cos t 
t is measured in radians.) 

PROBLEMS 

Prove that 

(1) Dt cosec t = —H cot t . cosec t 

(2) Dt cosech t = —coth t. cosech t 

(3) Dt cot t = —/f (cosec t)'^ 

(4) Dt coth t = —(cosech ty 

(5) Dt tanh t = tanh t. sech t. 

(6) By using the fact that = i/?'*, prove that 

Dt = (-H - -■ 

(7) Differentiate with respect to t the expressions 1/(1 -f- 
+ t), i/In t, tint — t, (/ In / ' 

8 «xx Functions of functions 

Suppose we have the following situation. A growing child has 
height X cm at age t years, x is then a function of /, and can be repre¬ 
sented graphically. Suppose further that we plot its weight tv kilograms 
against its height x. 'rhen at a given age these graphs may show that 
the height x is increasing at the rate of 5 cm per year, and further that 
w is increasing at the rate of -7 kilogram per centimetre iticrease in 
height. The weight w can also he considered as a function of the time t: 
can wc say how rapidly it is increasing? 

The obvious argument is to say that since the height increases by 
5 cm in a year, the weight (counted at *7 kg for each centimetre increase 
in height) must increase by 5 X 7 kg = 3 5 kg (per year). In other 
words, since the rate of increase in height can be written D,x, and the 
rate of increase in weight with respect to height is Dyw, we should expect 
that 

Dttv - DtX.D^w . . . (8.19) 

But our argument onl\' proves this for the average rates over a year. 
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= —sin /, where 
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However, we can readily show that the formula holds for momentary 
rates of increase. If in the interval of time 8/ the height x increases by 
8 x and the weight w by then 

8 to 8a; 8 w 

8 t 8 t * Sa; 

identically, as follows on cancelling Sa;. Now as 8/^ o, Sw/St and 
8 x 18 t tend to their respective limits DtW and DtX. Also as Si o, it 
is also true that 8 a; ^ o and so Sw/ 8 a; Accordingly on taking 

limits as 8i o we obtain equation (8.19). 

This method of argument is perf^ectly general, and applies to any 
function x of t, and any function zc of a;, whatever the letters t, x, and w 
may represent. It can be stated in words as: for 3 variables^ of which the 
first is a function of the second and the second a function of the thirds the 
derivative of the first with respect to the third is the product of the derivative 
of the first zoith respect to the second times the derivative of the second with 
respect to the third. 

In the alternative notation which is in common use, in which D^x 
is written as dxjdt, as if it was a quotient, the formula becomes 

dw dx dw 

dt dt dx 

'rhis form of notation has accordingly the merit of suggesting the 
correct answer — although strictly speaking dwjdt is not itself a quotient 
but only the limit of a quotient. But we shall see that this simple form 
is no longer valid when we come to consider partial differentiation. 

EXAMPLES OF THE USE OF EQUATION (8.I9) 

(1) To differentiate w = sin\/?- 

'I’he device used here is to put x = \/t, so that to — sin a*. Since 
.v = have DfX = —Since zv = sin a*, we have = 

H cos A* = H cos \'t. 

Thus D,w — DfX . D^w 

= { — ijlyt) . H COS\''t 
~ —//(cos \ 't);Z\ t 

(2) Differentiate zv - In (i -: /) with respect to t. '^fhe necessary 
substitution here is to put i -r- / = a', so that w — In x. Then 

D,x - /), (i i 

Dyw — /)j, In A- = I /a* = 1/(1 -{- t) 

Sometimes we may need two intermediate steps to perform the 
differentiation. Suppose that x is a function of /, v a function of a;, and 
zc a ftmetion of v. ’I'hen 
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hw _ Sx Sy 

Bt Bt' Bx ' Sjy 

identically. Therefore on taking the limit as Bt o, 

D,w = DiX . Dj-y . DyW 

dw _ dx dy dw 

dt dt ' dx' dy 

as it may alternatively be written. For example, suppose we wish to 
differentiate [ln(i + with respect to t. I’he necessary substitutions 
are 

X = I 4- DtX = I 

= In (i + i) = In x\ D^y = x-^ 
w = [In (i -f t)Y = y^', =-^ 

so that 

DtW = DtX . D^y • DyW 

= 2 x-^y = 2 (i + 0“^ + 0- 

This method can be applied to cases where there is a relation be¬ 
tween variables which is not expressed explicitly in the form is a 
given factor of t*\ 


FURTHER EXAMPLES 

(3) A ladder of length L is leaning against a wall, the foot of the 
ladder being at distance x from the f<jot of the wall, and the top of the 
ladder being at height y (Fig. 8.6). If the foot of the ladder is pulled 
away from the wall with velocity v, i.e. D,x = v, at what rate D,y does 
the top of the ladder descend? 



Fig. 8.6—Moving ladder problem 
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By P)rthagoras’s theorem, 

Differentiate this equation on both sides with respect to t: the new 

equation we obtain must be true, since the equation 

remains true as x and y vary. Now = DfX . = -zxv^ and 

= zyDty. Also DjL- = o, since is constant. 

We have therefore 

2XV + 2yDty = o 

i.e. D,y = — xvfy. 

The minus sign shows that (as expected) y is decreasing when x is 
increasing, and when v is positive. Since y = — x^) this can also 

be written as 

D^y = — Xvl'\/{L^ — x'^) 

If we found that L = 2-6 metres, and v = 1 metre per second, then 
at the moment when :v = i metre, y = 2*4 metres, and Dty = 

= *42 metres per second. 

(4) Differentiate _y = sin~^ t with respect to t. 

The equation V = sin“^ t means by definition the same as ^ = sinjy. 
On differentiating both sides of this equation with respect to t we 
obtain 

Dft = Dt sin jy 

= Diy . Dy sin 3; 

= H cos y . Dty. 

But Dit I, and therefore Dty — i/i/ cos y. And since sin y = #, it 
follows that cos = ±\/(i — /-). Thus 

Dt sin“V = i/// cos sin“*/ 

"" - • (8-20) 

(5) Differentiate y = tan'* t. Since tan y = /, and Z)„tan y == 
(sec j’)-, we obtain by differentiation 

H (sec y)- Dty = Dft = i, 

so that DiV = I,'// (sec y)-. Now (see _v)“ = i + (tan y)^ = i + 
and therefore 

Df tan-* / - 


//(I +/•-) 

(6) Differentiate y = f' for general (not necessarily integral) values 

of 71 . 

By definition v = c"'"* or 

In V -- 77 . In t. 
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On differentiating with respect to t 

jy-i Dty = nt-^ 

and on multiplying both sides by y 

Diy = nyt~^ = = 7iV'-^. 

(7) Differentiate >»■ with respect to /, where y and z are given func¬ 
tions of t. Let tv — y^, then In = 0 In y. Hence by the product rule 

Dt\n tv = In y . DiZ -{- ar Z), In y 
w* Dfiv = Injy . DtZ + zy~^ Dty 

Multiplying through by tv = y~ 

Dty- = y^ (Injv . + zy-^ D,y). 

As a particular case let y = K, a constant, and z = t. Then 

* 

U,y = o, DtZ = I and 
Dt = K* In K. 


Table 8.1—Standard forms 


Function tv 

Ocrivativc Dtw 

Function ft* 

^ Derivative Dttc 

y z 

D,y + DtZ 

cosh t 

sinh t 

Ky 

KDty 

sinh t 

' cosh / 

yz 

z Dty + yDtZ 

tanh t 

(scch t)'^ 

ylz 

z-\zDty — yDt^ 

coth t 

— (coscch 

y" 

y\zy~^D,y +\ny.DtZ^ 

scch t 

— scch t . tanh t 

w = F(y) 

Dty . Dytv 

coscch t 
cos“‘ t 

, —coscch t . coth t 

L [i — 

K 

0 

sin“* t 

[l _ /2]-l/2 

r 


cosh“‘ t 

^ [fi _ ,]-J /2 

log Kt 

Mjt 

sinh"’ t 

[ti + l]-'/2 

In Kt 

ijt 

tan"’ t \ 

I/[I 1 f^] 

anti log t 

M~^ antilog t 

cot"’ t 

--l/[l + f^] 

e* 

e* 

tanh"’ t 

l/[l -- /2] 

cos t 
sin t 
tan t 

cot t 

sec / 

— sin / 
cos t 
(sec ty 
—(cosec t)'^ 
sec t . tan / 

coth"’ t 

1 

1 

1 / I - F] 

coscc t 

—coscc t . cot t 




I 
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8.12 Standard forms 

We are now in a position to find the rate of change of any function 
which can be expressed explicitly or implicitly in terms of the oper¬ 
ations of addition, subtraction, multiplication, division, the taking of 
powers and roots, logarithms, antilogarithms, trigonometric or hyper¬ 
bolic functions, or any combination whatever of these operations and 
functions. It is useful to have a table of “standard forms” summar¬ 
izing the rates of change of all the common functions. In Table 8.1 
(p. 191) all differentiations are with respect to t. The letters K and n 
denote constants. NI = log e is the modulus of common logarithms 
“ '4343 ♦ • • • For all trigonometric functions, sin /, cos etc., t is 
measured in radians: this eliminates the constant H (which now be¬ 
comes i). y and xr represent arbitrary functions of t. 

It is scarcely necessary to remind the reader that skill in differentia¬ 
tion, as in any other activity, comes only with practice. In particular the 
derivatives of Z”, e\ log t. In t, and the trigonometric functions should 
be known by heart. 


PROBLEMS 

(1) Differentiate zv = {2 -\- t)- with respect to t in the following 
ways: 

(i) multiply the expression out in full and differentiate term by term 

according to the sum rule; 

(ii) write zv = (2 4 t) (2 -i- /) and differentiate by the product 

rule; 

(iii) write .v — 2 + / and differentiate by the function-of-a-function 

rule; 

(iv) use the inverse relation / = 4- y'zo — 2. 

(2) .4 man is lifting a kilogram weight by raising the forearm only, 
keeping the elbow fixed. If the angle 0 between the forearm and the 
vertical is decreasing at the rate of i radian per second, and the length 
of the forearm is 30 cm, at what rate is he doing work? (in kg 
weight . mctre/scc). 

(3) I'hc velocity of sound in air is = 66-3 vXT + 273) feet per 
second, where T is the Centigrade temperature. What is the rate of rise 
of velocity per degree Centigrade at 10° C? 

(4) II a hemispherical vessel of radius r cm contains water to a depth 
of .V cm, then the volume of water contained is V= n {rx- — cc. 
Water enters such a vessel of radius 3 cm at the rate of 2 cc/sec. Find 
the rate at which the level is rising, expressing it in terms of the depth 
AT. Draw a graph of this rate against .v, and show that the rise is veiy' 
rapid at first and afterwards slows down. 
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(5) The Schiitz-Borisoff law with regard to the action of enzymes 
such as pepsin and rennin is expressed by the formula .v = K\/{Fat), 
where F is the concentration of enzyme, a the initial concentration of the 
substrate (e.g. albumen or milk), t the time elapsed, x the amount trans¬ 
formed, and K a constant. What is the rate at wliich the transformation 
is taking place ? 

(6) A ball thrown upwards with initial velocity u metres per second 

has height 3; — ut — ^ metres after t seconds. What is its velocity 

at that time ? 

(7) A man is walking at the rate of 2 metres per second towards a 
camera of focal length 16 centimetres. When he is 10 nietres distant 
from the camera at what rate must the ground-glass screen be racked out 
to keep him in proper focus? 

(8) A galvanometer mirror is i metre distant from the scale, and 
the spot of light is moving along it with velocity 15 cm per second. 
When it is deflected 17 cm what is the angular velocity of the beam of 
light, and what is the angular velocity of rotation of the mirror? 


^•*3 Warning—how not to differentiate! 

Suppose we are given the following question. A body thrown up¬ 
ward with velocity 6 metrcs/sec has at time t a height jk’ —- 6/ — 4-905/-. 
What is its velocity after i second ? 

If we differentiate y we obtain the velocity 

V = D,y = 6 — 9'8i/ 

Substituting / = i we find v = —3-81 metres/sec, the minus sign 
indicating a downward movement. 'I'his is the correct answer. 

We might be tempted to argue thus: when / 1, the formula jy = 

6/ — 4-905/^ gives the height y = 1095 metres. 'I’lierefore the velocity 
V, being the derivative of y, is D,y = />,( i 095) - o, .since the derivative 
of a constant is zero. This is evidently not quite accurate; such an argu¬ 
ment would prove that the velocity at any instant was zero, and the ball 
would not be moving at all. 

In the relation y = (it — 4-905/'^ the symbols y and t are variables 
m the first place, and the algebraic relation is equivalent to the whole 
graph of plotted against /, or to a cine film showing the flight of the 
ball. If we put / = 1 we reduce the variables to fixed values, or concen¬ 
trate our attention on a single point on the graph, or as it were lake only 
a rapid snapshot. And we can never find a velocity by considering the 
position y for one instant alone. 

'I’he moral is that when finding a rate of change at a particular 
moment, we must differentiate yirj/ and sub.stitule the particular value 
we are interested in afterwards. We can only differentiate variables, 
and once a particular value has been specified our quantities are no 
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longer variables. (A simple trap—but one which can now and then 
catch the best of us unawares.) 

What notation may we use for “the value of D^y when t— i** ? We 
cannot write this as Z)^ *’095 even though jy has the particular value 
I *095 when / = I . The usual device is to write = a or {dyfdt)i — j 

with a suffix indicating the particular value to /. A more elegant method 
is to use the functional notation y — Fify. We then write the derivative 
as V = F\t). F’{t) is called the “derived function of repre¬ 

sents the velocity v expressed as a function of the time t. We can there¬ 
fore write the velocity at time i = i as velocity at time t 2 

as F'(2), and so on. 

8.14 Motion in a ciirved path 

So far we have considered motion along a straight line. Now let us 
imagine a point P to be moving in a plane along a curve C. We can 
specify its position at any time t by its cartesian co-ordinates x and y 
which will be functions of t. Now let = (atj, jyj) be its position at 
time <1, and P^ = (.Va, y^ at time (Fig. 8.7). Let L represent the 



distance P has travelled along the curve C: the length of the arc P1P2 
is hL. Then it is natural to say that the instantaneous speed of the 
point P at time is the limit of ZLfht as o, i.e. DtL, and the 

instantaneous direction of motion is along the limit of the chord 
P1P2, i.e. along the tangent to the curve. 

It has already been shown that the tangent makes an angle ^ with 
the .v-axis, where tan 0 = D^-y. Since Dty = DtX . D^y this may 
alternatively be written as 

tan 0 = D,ylDtX . . . (8.21) 

provided that Z)<.v # o. 

To find DiL we proceed as follows. Draw a line through Pi parallel 


RATES OF CHANGE 


195 

to the^-axis meeting one through parallel to the v-axis at Z (Fie 
0.7). Then vs- 

P^Z = X2 — = hx and ZP2 = >'2 ^ >'i = S_y. 

Now when 5 / is small the distances a.v and hy will become small; 
and provided that C is a reasonably smooth curve the arc P,P will 
become very nearly straight. Thus by Pythagoras’s theorem 

{^Ly ^ {hxy + {hyY . . . (8.22) 

smaller St is, the smaller the percentage error in this equation 
Dividing through by (8/)2, * 

(SL/sty - (Sx/sty + (Sy/sty 

Take the limit as St -> o 

(n,Ly = (v,xy + (D,yy 

or 

instantaneous speed/),L = . (8.23) 

Note i. The square of Sx is often written conventionally as Sx- 
We could distinguish between (Sxy and S(x=) simply and consistently 
by using the notation proposed in Section 3.7, writing the first as Sa'F 
and the second as Sx^). ' 

Note 2. If we divide equation (8,22) through by and take the 
limit as Sx —> o we obtain the important equation 

(D^Ly = 1 + (D^yy ^ 

■■ -i (^.yy . . (8.24) 


K X A M E I, E S 

(1) A projectile has height _v lo; — 4 9/^ metres and has covered 

a horizontal distance x 10/ metres at time / seconds. Find its speed 
at time t. ^ 

(8.23) {D,ty - io2 + (10 - 9 80 " 

— 200 — 19-6/ I 96-04/' 

speed - DiL V(20o - 19-6/ -|- 96 04/2) ni/sec. 

(2) A point is moving along the circle .v'-* H 1 with uniform 

speed V. What arc the components of velocity 79 ,.v and D,y} 

On differentiating the relation x- y'^ i with respect to / we 
obtain 2xD,x 1 2y/),y ■■■ o, i.e. either y o and /J,x - o or else 
^ty = —xD,xly. hut 

r2 (D,xy 4 (D,yy 

- (D,xy i- a; 2 (/9,.v)2/y2 (if-y o) 

= {D,xy (y-2 4 

(D,x)'/y'^ since ^2 -f-i. 

Pherefore (D,xy yH''‘. If y - o this still holds, since then Dix = o. 
Thus in any case DfX — i y^'- Similarly D,y -y xv. 
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PROBLEMS 

(1) A body is moving according to the law x — zt^y = — In t. 

What is its velocity at any given time ? 

(2) A body is moving with constant speed v along the parabola 
y = x^. Find DiX and D^y. 

(3) Show that if the position of a point P is given in polar co-ordin¬ 
ates as {r, 6 }, 6 being measured in radians, then its speed DiL at any 
time is given by 

{D,Ly = {D,ry -h {rD,ey- 

(Hint: draw a diagram showing small changes in r and 6 .) 

(4) A point P in space can be specified by 3 cartesian co-ordinates 
(jtf, y, 2-). These co-ordinates are the distances of P from 3 fixed mutu¬ 
ally perpendicular planes, such as two walls of a room and the floor. 
Show by repeated application of Pythagoras’s theorem that the speed of 
motion of P along any path C is given by 

{D,Ly = {D,xr- -h (D^yy + 
and that L is the distance gone along this path. 


8.15 The formal definition of a limit 

For the sake of completeness, and absolute accuracy, we now give 
the strict definition of a limit. This is mainly of interest from the 
theoretical and logical point of view, and the reader who is so inclined 
can omit these rather complicated definitions; they are not essential 
to the general understanding of the later chapters. 

Suppose that .v is a variable, and y a function of Let AT be a 
particular value of .v. Suppose we can find a number Y and a function 
^(e) with the following properties: 

(i) </>{€) is defined for all € > o, and, for those values of €, 0(e) > o. 
(The values of 0(c) for c < o are irrelevant to the definition, and 0 (c) 
does not need to be defined for such values of c.) 

(ii) If o < t .V — A' I < 0 (c) then | y — F ( < c. 


'rhen we say that by definition v 


i as .Y ~ A’’, or lim y = Y. 


In ordinary language this definition says that we can makey as near 
to Y as we wish, in fact we can make^v differ from Y by less than c, by 
making .v differ from A' by less than'0{c). The reader may think'the 
definition rather complicated which is why we have not stated it before 
—but it is in fact the simplest possible definition which agrees with 
one’s commonsense idea of a limit. Any other definition either includes 
cases which one would not ordinarily consider as limits, or excludes 
some which are eligible. 
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EXAMPLES 

(1) Suppose j’ is a constant C, show formally thaty- ~> C as .v X. 

Put V = C. Choose any function satisfying (i): the simplest 
choice is ^(e) — 1 for all Then (ii) is automatically satisfied since 
y — y o identically. Thus the limit is proved; we have found the 
required number Y and the required function <i(e) satisfving conditions 
(i)and(ii). 

(2) Suppose j = ^x : show thaty Y ^ 3.Y as .v -> X. 

Choose as function ^(c) Then condition (i) is evidently 

satisfied. To demonstrate (ii) we have 

\y - y\ = - 3^ \ 

= 3(-v-^)| 

= 3 I .Y — a: I [by (4.1)] 

so that if I Y — X \ c ^ c we must have [3' — T I < c. 

(3) If limy' = y, and lim z - Z where v and z arc two functions of 

Y, show that lim {y c) ~ Y + Z. 

X- .Y 

Ky our definition of a limit of y there must be a function ^,( c) which 
is positive for all positive c and which is such that if o < | y ~ A' [ 
< ^i(€) then I 3- — y I < c. Similarly since lim z Z there must be 
a second function > o such that if o < I Y — A I < then 

\ z — Z \ <" £. We wish to show that (3- -I- z) ->■ {Y -\ Z), which 
when written out in full means that there is a fimction such that 

(>) > o if € • o and (ii) If o • | y — A" j • ' 0 ;j(«) it follows that 

(y z) (Y ■} Z) \ ■ (. Now using the theorems of Chapter 4 we 

lave in any case 

1(3- , xr) {V \ Z)\ - 1(3' Y) t (^ - Z)\ 

■ \y - y \ ^\- \ z ~ Z \ . (8.25) 

If we then take ^;i(c), given the value of c to be the least of the two 

numbers ^ e), we see that ^;,(c) is positive since by hypothesis 

^,(A<) and are (lositive. Also if | y - - A | is less than ^;,(€), it 

is less than ’hiHc) and and so, by hypothesis, | 3' — Y \ < ic 

an«l \ z - - Z \ ■" I c. By (8.25) it follows that | (3’ I ~) ( y -j Z) | < e, 

and the result is established. 

'I’he pro»jfs of lurtlier results, such as the limit of a product or a 
quotient, invohe more complicated reasonitig. I'or example, if we 
wish to show under the conditions we have given above (v --> T and 
~ Z as Y - X) that the product 3’xr yZ we use the identity 

I 3 '^ - TA I j ( 3 - - - Y) {z Z) Y{z Z) \- Z {y Y) \ 

l 3 - - y j U - zt -t- I y I U - Z| 4- IZIIy- - y| 
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Now let ^4(€) be the least of the 4 positive numbers ^a(i\/€), 

^2(e/4 I Y I), I ^ I) provided that Y and Z are both different 

from zero. (We leave the reader to work out the modifications needed 
if 5 ^ = o or 2 ” — o.) Then if | x — X \ < ^4(e) we must have \ y — Y\ 

< iV« and I ar — 2 I < so that | j ^ I \ ^ — 2” | < also 

I ^ — 2 I < e /4 I F I, so that | Y \ | ar — 2 | < Je, and \y — Y \ 

< e/4 I 2 |, so that \ Z \ \ y — F | < ^e. From this it follows that 

I yz — F 2 I < f c <: e, so the result is established. To establish for¬ 
mally our theorems that lim sin x = H and lim log (i + x)/x = M 

X-*0 X‘ > 0 

involves much more difficult juggling, and we shall not go further into 
the matter. 


PROBLEMS 

(1) Show that if jy = f(x) can be written in the form y — Y -\- 
^ (x — X), where ^ is bounded (Section 4.4), i.e. | y 5 | is less than some 
fixed number B, then^ ->■ Y as x X. 

(2) If y = f{x)f Y — f{X), and if the ratio 

hyjhx ={y- Y)l{x - X) 

is bounded for all values of x in some range X — k c. x <, X k, 
then y F as .v ->• 2 ^. (Here k stands for a given fixed positive number.) 

(3) If J F as X — > A', and if k is some value of ^( e), then y is 
bounded for all values of .r between X — k and X + k. 

(4) If dyidx exists when .v — A', then j -> F = f{X) as x ^ X. 


We can readily modify the above definition to cover the case of a 
limit of j as .v —> 00. We say that if there is a function ^(c) defined for 
all positive € such that when .v > ^(e) then \y — Y \ < e, then by 
definition jy -> F as a* 00. The proofs of other properties follow for 
this case by suitable modification. 

8.16 Historical note 

The idea of the calculation of rates of change, or “differential 
calculus” is due independently to Newton (1642—1727) and Leibnitz 
(1646-1716): it was also discovered about the same time by the Chinese. 
1 he exact formulation in terms of limits is a nineteenth-century develop¬ 
ment. Leibnitz seems to have thought of the derivative dyjdx as the 
ratio of two infinitely small changes dy and dxi indeed we still often 
use this idea of D^y or dy.'dx being the' value of Sy/Sx when Sx is small 
as a very helpful rough guide, although when we wish to be precise, we 
nowadays say that it is not the actual value but its limit as Sx O. 


CHAPTER NINE 


THE CALCULATION OF SMALL CHANGES 


9.1 Small changes and errors 

The derivative D^w — dwjdx is defined as the limit of Zwjhx as 
8 x o, where Szv and 8 x are changes in the variables zv and x respec¬ 
tively. It follows that when 8 x is very small the ratio Sw/Sx is very 
nearly equal to D^zo. If we are working to a certain degree of approxi¬ 
mation, as in practice we must, then the difference between Dj.zo and 
8 tvl 8 x may be quite inappreciable — or even if it is not absolutely in¬ 
appreciable it may still be too small to be of any importance. In such a 
case 

8 wj 8 x =: D;,W 

or on multiplying through by 8 x 


8 tv ^ . 8 x . . . (9.1) 

If there is a formula expressing w in terms of x it is easy to calculate 
D^w. Equation (9.1) then shows how we can find the effect on etJ of a 
small change 8 x in x. 


EXAMPLES 

(i) A cube of side i centimetre is being used as a unit of volume. 
On a hot day it expands to 1002 centimetres. What is its change in 
volume ? 

Let the side of the cube be x cm, and its volume to cc. Then w — x^, 
and therefore Thus when 8 x is small. If 

X = I and 8 x = 002, then 8 zv, the increase in volume, is 3 x x *002 
= *006 cc. 

{2) An electrical resistance of R ohms is being used for heating. A 
potential difference of V volts is maintained across it by batteries. If as 
the batteries run down the potential difference falls by 8 v volts, what is 
the fall in the rate of production of heat? Assume that by Ohm’s law 
the current / flowing through the resistance is / = VjR amps, and the 
rate W of production of heat is the product of the potential difference 
and current; W = VI = V^jR watts (or V^j^ zR calories per second). 

From this formula DyW = iVjR, and so 8 W ~ 2V . 8 VjR watts 
If for example a voltage drop V = 10 volts is maintained across a 
resistance R of 20 ohms, and V falls by 8 V -= —*2 volts (using a 
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negative sign to indicate a fall), then the power falls by 81 V = 2 X 10 x 
(—•2)/20 = —2 watts. 


This argument can be applied to small experimental errors. Sup¬ 
pose we have a metal cube whose side is measured as x centimetres. 
Then its volume will be calculated to be oj = cc. If the measurement 
of the side is in error by an amount S.v then the volume will be wrong by 
approximately D^tv . Sa: = 3.v*Sa cc. If we measure the side 8x with 
an ordinary ruler with an error of not more than 02 cm, then | Sa ) < 
•02, and I Stt) I ^ | Sa | < ■o6a'-. This shows us the maximum 

error we can expect in the calculated value of the volume. 

It is important to be quite clear what sign to give to an error. It 
seems logical to call the difference (observed value — true value) the 
“error” in the measurement, and the opposite quantity (true value — 
observed value) the “correction for error”; i.e. the correction is what 
must be added to the obser\'ed value to give the true value. In the 
equation 

8 w DjW . 8 x 

we can interpret Sec and 8 x as errors in tv and x respectively. Alter¬ 
natively we can interpret them as corrections for error. It does not 
matter which convention we use, provided that we keep to the same 
convention throughout the calculation. 

Often we are interested not in the actual error but in the propor¬ 
tional or percentage error. The error 8x expressed as a fraction of x 
will be S.v/.v; as a percentage it will be 100 S.v/.v per cent. Now In x 
= i/.v, so that if S.v is small, S In .v £)j. In x . S.v = Sxlx. In other 
words a small proportional error in a measured quantity .v is equal to 
the actual error in its natural logarithm In x. The percentage error is 
obtained by multiplying the proportional error by 100. 

Considering again the example of a cube of side .v and volume w = 
.V®, we have In w — 3 In .v, so that 5 In w = 35 In x. The proportional 
error in the ^■olun■le is 3 times the proportional error in the side. 


PROBLEMS 

(1) An angle 0 is measured in degrees correct to i decimal place, i.e. 
with a maximum error of about •05''. Given the value of 6 w’hat is the 
maximum error in sin 6 and tan d? What is the maximum proportional 
error ? 

(2) A circle has its radiu.s R measured to the nearest tenth milli¬ 
metre. I low- accurate is the calculated area 77/?-.? 

(3) pole measured as 12 metres high is found to throw a shadow 
16 metres long. If the measurement of the height of the pole is too 

small by i per cent, what is the error in the calculated angle of elevation 
(>f the sun ? 
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(4) An error of 'ifx is made in measuring the diameter of a red 
blood cell as 7-5^. Assuming that the cell is spherical, what is the per¬ 
centage error in estimating its volume (as and surface area (as 

477-/? 2 )? 

,(5) The current / amps passing through a tangent galvanometer is 
estimated as / = AT tan 0 , where 6 is the angle of deflection and K is a 
constant of the particular galvanometer used. Assuming that K is 
accurately known, and that the accuracy (expressed as maximum actual 
error) in the measurement of 0 is the same for all values of 6 , what angle 
of deflection will give the minimum percentage error in the calculated 
value of the current? 


9.2 Partial differentiation 

In all the cases discussed so far each variable quantity is a function of 
any other relevant variable. 'I'hus in proving the formula I),w = 
Dty . we considered the example of a growing child, whose height^* 
was a function of the time /, and whose weight w was a function of j'. 
It follows that IV can also be considered as a function of /; and this is 
the most natural way to look at the example. We can think of the change 
in i.e. the child’s growing older, as the cause of the change in y, the 
height, and w, the weight: we can say that3' and w are “dependent” on 
/, i.e. functionally related to t, or determined in value by the value of /, 
while i itself is the “independent variable”. 

This is the commonsense approach. But considered from a purely 
mathematical point of view, in which we are interested only in the 
relationships between the values of t,y, and w, and not in stating which 
is the cause and which the effect, then we can equally well express t and 
as functions of 3', or t and y as functions of w. We do not think of the 
height 3; as “causing” the time / or the weight ze; but to each value of 3* 
there will correspond a value of / and a value of w. (Conceivably there 
may be more than one value of t for a single value of v. But this docs 
not really introduce any difliculty: we know that many functions con¬ 
sidered in mathematics, such as sin-'.v, for example, are not single¬ 
valued.) 'I he important point is that any one variable can be called 
the “independent varial)le”, and all the others will then be “dependent 
variables”, i.e. determined in value when the value of the independent 
variable is known. 

However, when we consider the effects of small changes it is very 
artificial to restrict ourselves to systems in which only one quantity can 
vary independently. 'I'lie result of an experiment may l>e influenced by 
a considerable number of factors. 'Bhe course of a chemical reaction 
will certainly depend on the concentrations of the reacting substances 
and on the temperature. 'The growth of an animal will depend on the 
amounts of the various proteins, vitamins, carbohydrates and fats it 
receives in its diet, and all these are capable of independent variation. 
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In calculating the results of an experiment we generally have to take 
into account not one but many measurements. Each measurement may 
be subject to an error which will affect the final result, and there is no 
way of inferring the error in one measurement from the error in another. 
Thus it is often necessary to consider a quantity ro as a function of 
several others; w = fix, y)^ or w = /(x, y, z). If x, jy, and z are subject 
to small changes 8 x, Sy, and Sz respectively, what is the effect on w ? 

In experimentation there is a rule “study the effect of altering one 
variable at a time**. In any particular experiment this may or may not 
be a sound principle, but it is certainly useful for our problem. Suppose 
w is a function of 3 variables a?, y, and z, all of which can vary indepen¬ 
dently of one another. We begin by keeping y and z fixed and allowing 
only X to vary, w then becomes a function of one variable only, that is, 
a function of x, and so we can then differentiate w to get its derivative 

D^w and apply all the results previously obtained—for example that 

Sw 2^ D^w . Sx. 

In order to emphasize that y and ar are being kept fixed we shall 

denote this derivative by the symbol vj or f and call it the 

\dX J 

“partial derivative of w with respect to .v keepingy and constant*’. The 

phrases “partial derivate” or “partial differential coefficient’’ mean the 

same as partial derivative”. The derivative is, of course, “partial” 

because when y and z are kept fixed the possibilities of variation are 

restricted to only a part of what they might otherwise be. It is to make 

this clear that we use the special letter d. This is simply the Russian 

form of the small letter d (the name of which, in Russian, is practically 

identical in sound with the standard English pronunciation of the word 

dare ), But as a rule the symbol dwj dx is spoken as “partial du> by 
dx'^ or “curly dw by d.v”. / e y j 


EXAMPLES 


^ y'> then = 2, = zy. For when we 

ditterentiate with respect to .y, keepingy constant, the termy^ will have 

a zero derivative. Conversely when x is held fixed the term 2X will 
contribute nothing to the derivative. 


(2) If _ x~ + zxy +y^ 3 , = zx + zy, ==2X-|- 

= 3y2r2. 


PROBLEMS 

Find D^i,w = {etvjdx)^ and = {ewfey), when (i) o) = 

.V In y; (2) a- = .vVy; (3) + y). 


We have shown above that if x alone varies by an amount 5 x, tv will 
change by an amount hto given by the equation 
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Sh) , to) Sx = — ^ ^X. 

In the same way it is possible to investigate the effect of a change in y 
alone, keeping x and z fixed. If we then differentiate to with respect to^* 

we obtain the partial derivative and when Sv is 

small 

Sto (Z>^I^,*to) Sy = 

Similarly if only z varies 

Sto (/)z la-.yw) Sz = Sy 


These formulas show the effect of small changes in any one of the 
three variables separately. Suppose now we want to change all three; 
let us say that x changes from to .*2.7 from yi toy 2, and z from z^ to 
Z2, where all three changes 8.x = ^^2 — a:,, Sy = yg — y^ and Bz = 
sii — Zi are small. Let us write to = f(x, y, z), to, = /(.v,, y„ a:,) and 
^2 “ /(^8.3^2* •3^2)* Then as a result of the changes to is altered in value 
from TOj to TOg. We wish to compute 8 to = TOg — TOj. 

The natural method is to change .v first, then y, and finally z. If x is 
changed from to Xg to will become /(ATg, y,, z^) and the change in to 
is approximately given by the formula 


/(x2.yi,^i)—/(xi,yi,sri)^(D^!y.:W)8x . (9.2) 

where we take the value of the partial derivative when a: = atj, 

y = >’v ^ = ^1- 

This first step is from (.Vj, y,, z^) to (ATg, yi, z^). The second step is 
from (Xi, yi, 2,) to (ATg, yg, x?j): that is, we change y from y, to yg, 
keeping x fixed at the value of g and z fixed at the value z^. We then have 
the change in to 


/(jf 2 . >* 2 . 3r,) —/(A:2,y„5r,)=^(Z>^,,.,TO) Sy . . (9.3) 

where the derivative is now to be calculated for the values 

5 = ^2»3' = >'i.^ = The final step is from (ATg.yg, 2rj) to (.Vg.yg, arg), 
i.e. a change Bz in z from to Z2 keeping a: fixed at ATg andy fixed atyg. 
Accordingly 

/(jfg, yg, Sg) —/(A-g. yg. jTj)--(Z>,|^.^ to) Ssr . . (9.4) 

where (Z), fg.ytv) is to be calculated for x = ATg.y = yg, z = ar,. 

The total change Sto = TOg — to, =f(x2,y2, ^2) — /(^i,3'i. -^i) will 
be the sum of these three separate successive changes, so on adding 
these three equations (9.2), (9.3) and (9.4) we obtain 

Sto ^ (Z)^I„.,to) Sa: + (/),,I^,,to) Sy +(/), |:r.„TO) Bz 


(9-5) 
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Now we have said that in this equation the derivative (Da.\y^^w) is to 
be calculated for the values x = x^y y ~ jy,, z — Zj, the derivative 
(Dyix.3^) for the values x = Xg, y = yi, z = Zi and the derivative 
(^r \z.v^) for the values x = X2,y = y^, z = z^. But in any practical 
example since x^ and X2 are very nearly equal it will not make any 
appreciable difference to the value of the partial derivatives whether 
we take the value when x = .Vj or when x — or whether3? = or 
jy 2 or = Zi or z^. To be more precise, occurs in the formula 

multiplied by Sxy which is a small quantity, so that any small change in 
{D^\y,.zv) when multiplied by 8 x will be quite negligible—the formula is 
only an approximation in any case, though the smaller Bx, By and 5 ^ 
become the closer is the approximation. The same remarks apply to 
the other two derivatives appearing in the formula—it does not matter 
whether we t^e a? to be .r, or ACg, j to bey^ orj-g, and s to be z^ or z^. 
The formula is thus a general one. In the alternative notation it can be 
written ..-.v 

^ (9.6) 

For the purposes of illustration we have taken w to be a function of 
three independent variables .r, y, and z. The same sort of argument will 
apply if we have more or less than three: if zo was a function of two 
variables, x and 3% the appropriate formula for the change in tv produced 
by small changes Bx and By in x and jy respectively would be 

Sk) (D^^ytv) Bx + (Dy^^tv) By 


Ihe formulas (9.5), (9.6) and (9.7) have a quite simple interpretation. 

(9*5) know that Bx is (approximately) the change in zo 

produced by a small change Bx in .v alone, y and z being kept fixed. 
Similarly (Z)„|3. Sy is the change in tv produced by the change By 
acting on its own, and (D- ]r yw) Bz the change produced by Bz on its 
own. Thus the equation (9-6) asserts that the change in w produced by 
simultaneous small changes Bx in ,v, Sy in y, and Sar in z is obtained to a 
sufficient accuracy by adding together the changes which would be 
produced by each of S.v, Sy, and Sxr acting separately. This is a very 
important result. 

It is, of course, essential that the changes S.v, By, and Bz should be 
small. It will not be true in general that the effect of a large change in 
.Y, 3', and z simultaneously can be got by the simple addition of the eff'^cts 
of each of the changes acting on its own, without any change in the 
values of the other variables. 


9*3 Calculation of the effects of several simultaneous changes 

The formulas (9.5), (9.6) and (9.7) given in the previous paragraph 
have a somewhat formidable appearance owing to the large number of 
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suffixes. This appearance is quite deceptive, for they are very easy to 
apply. It is only necessary to remember that the symbol Z),, means 
“differentiate with respect to x, treating y and z as constants”. In doing 
this differentiation we can make use of any or all of the techniques we 
developed in Chapter 8 . 

For example, suppose that we are interested in the area of a rectangle 
of sides, say, y and 5?. What will be the effect of making small changes 
in the sides? We know that the area w = yz. Differentiation with 
respect to y only gives = {dwjdy): = z, differentiation with 

respect to z only, Dt\yV} = {cw}dz)y = y. Therefore 

Bio 2; {Dy\:w) By + {D:\yW) Bz 
= zBy + yBz. 

The same principles apply to errors of measurement. If we have 
measured the side y with a small error Sj>’ and the side z with a small 
error S^r, then the error in the measurement of the area is zBy 

The reader can compare this formula with Fig. 8.5 wltich shows that 
Bzv = Zi By y.^ Bz exactly. If By and Bz are small then we can write 
y in place of y.^ and z in place of Sj; the error in replacing y^ by y when 
multiplied by Bz will give an exceedingly small quantity, quite negdigible 
for practical purposes. "^I hc difference between the former application 
of this result and the present one is that in Chapter 8 we were concerned 
with the case in which the sides y and z were both functions of a single 
variable t, and so were connected by some relation, whereas here we 
suppose thatjy and z can vary independently. 


EXAMPLES 

(i) The volume F of a cylinder of radius R and length L is ttR-L. 
If there is a maximum error of -02 cm in the measurement of R and L 
(in cm) what is the maximum error in the estimate of F? 
Differentiating with respect to R only we have 


Differentiating with respect to L only, 


Therefore 


SF ^ ^ 7 . 7 rRLBR ^ ttR^BL 
7 tR{2LBR \ RBL). 


If I 87 ? I and | BL \ are both less than -02, | 8F | <7r7?( o4 L 4 - 02R). 
For example, if 7? = i, 2 | 8F | < lo^r -31 cc. 

(2) The altitude of the sun is calculated by observing the length, 
I metres, of the shadow cast by a pole of height h metres. If the measure¬ 
ment h has an error hh, and / an error Bl, what is the error in the measure¬ 
ment of the altitude a radians? 
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We have A//= tan a, or a — tan"^ WO- Therefore Dt^^ia — 
Dftl I tan"^W0 = A^/o (A//), Z)a|i WO (hy the function of a function 

rule) = [1 + W0"]“^ • (t/0- Similarly = [i + W0*]“K-W- 
Thus 

8a = [i -h W0"]‘^ (8A//) + [i + WO^-" (-A5///2) 

_ I /^8A hht\ 

I -h W0= W ^ V 

_ fSA — A8/ 

/=* 4- A^ * 

An alternative and simpler solution is as follows. The equation 
A// = tan a can be written logarithmically 

In tan a = In A — In / 
whence 8 In tan a — 8 In A — 8 In/ 

Now considering In tan a as a function of the single variable o, 

8 In tan a ca {Da In tan a) . 8a 

= {Da tan a)/tan a . 8a 
= (sec a)®/tan a . Sa 
= sec a . cosec a . 8a. 

Similarly 8 In A =s; SA/A, and 8 In / ca; 8///. Thus 
sec a . cosec a . Sa = 8A/A — 8///. 

On multiplying both sides of this equation by i/(sec a . cosec a) = 
sin a . cos a we have 

Sa ca- {8A/A — 8///) sin a . cos a. 

Now sin a . cos a = tan a (cos a)* = tan a/(sec a)* 

~ tan a/[i + (tan a)®] 

= (A/o/[i + {hm 

= hii{i^ + h% 

so that Sa == (/SA — AS/)/(/“ 4 A") as before. 


PROBLEMS 

(1) A rectangle has width at and length jy. What is the error in the 
estimation of the diagonal \'{x- + y~) when x and y have small errors? 

(2) The density of a cube of mass m and side s is m/s^. What will 
be the effect of small errors in the measurement of m and s ? 

9.4 The meaning of partial differentiation 

We can illustrate these results in the following way. Suppose that 
zv is any quantity which varies from point to point over a plane. For 
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example, w might be the degree of illumination, or the temperature, or 
the concentration of some substance diffusing through a thin film. At 
each point P, w will have a definite value. If we specify P by its cartesian 
co-ordinates {x, y), then w can be considered as a function of .v and^-. 

Now when we differentiate zo partially with respect to x, keeping y 
fixed, we are in effect limiting P to a horizontal line y = constant (Fig. 
9.1). The derivative Dx\yZO shows how rapidly w varies as P moves 



Fig. 9.x—Partial differentiation in a plane 


along this line. In the same way the derivative Dy^j^w = {dwjdy)^ 
shows how zv varies when P is restricted to move along a vertical line 
X = constant (Fig. 9.1). 

The point P can equally well be specified by its polar co-ordinates 
{r, 0 } and therefore w can be considered as a function of r and 6 . The 
derivative then shows how w varies when P is restricted to the 

curve $ — constant, i.e. a straight line through P running out from the 
origin O, while DQ\rW will give the rate of variation with respect to d 
along r = constant, a circle with O as centre. 

These do not exhaust the possibilities. We could, for example, 
specify the point P by the co-ordinates .v and r. This is an unusual 
combination, but it is certainly a possible one, and might indeed be 
useful in certain special problems. We can then calculate 
meaning the rate of change of w (with respect to x) when P moves 
along a circle r = constant. Comparing this with which is the 

rate of change when P moves along the horizontal line y = constant, 
we see that there is no reason in genera! why and D^fyW should 

be the same: they are the answers to dill'ercnt questions, and the 
movement of the point P is restricted in different ways. 

As an example we may take w to be equal to r'^ --- x^ y‘^. Express¬ 
ing lu in terms of x and r, zv = r^, D^^rW - o. This is natural, since 
when we keep r fixed the value of w is fixed too. On the other hand, 
since to x"^ + y^, Dx\v^ — which is different from except 

when X = o. The moral is that in any partial differentiation it is 
essential to specify exactly what variables are being held constant. 
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An even clearer example occurs with the problem of the energy E 
contained in a mass of gas of volume V at temperature T and pressure 
P. We know that F, T, and P are connected by the gas equation 
PV = RT where is a constant. Thus only two of the three quan¬ 
tities P, V and T can be independent variables, since when two of 
them are given the value of the third can be deduced. The energy E 
can therefore be expressed either as a function of P and P, or else 
as a function of P and P, or finally as a function of T and P, and there 
are no obvious reasons for preferring any one of these to any other. 
Now Dy^pE means the rate of change of the energy as the volume 
changes when the pressure is constant, and Dy\iE the rate of change 
when the temperature is constant, and there is no reason why these 
two rates of change should be equal. Again it is essential to specify 
what is being kept constant when we differentiate. 

Nevertheless there are cases where it is convenient to omit the 
suffix. If Z4J is a function of two cartesian co-ordinates x andy then 
when differentiating with respect to x we shall almost always be inter¬ 
ested in the derivative with y held constant, and when we differentiate 
with respect to x will be fixed. Thus it will usually be sufficiently 
clear to write Dj.w or dwj dx for y w, and Dyto or dy for Dy\ juo. 
Again if we are using polar co-ordinates instead of cartesians, the 
expressions dwj dr and dwjdd will mean {dwj dr)^ and (8wl8d)r as a 
rule. If we say explicitly "'w is a function of three variables x, y, and 
z’* (where .v, _y, and z denote any variables, not necessarily co-ordinates), 
or if we write w = f (.v, y, z), then D^w or dwj dx will always be under¬ 
stood to mean Dj:\y,,w = (dwjdx)y,g, and similarly for DyW and 
DgW. An alternative notation which is frequently used is to write 
w^ or fj. {x, y, for D^w, Wy or/y {x, y, z) for Z)„w, and w. or/- (.t, y, z) 
for DgW. Thus equation (9.5) can be written in any one of the forms 

Szy =- Dj,w . S.v + DyW . Sy -j- D^w. 8 z 

= (5) + (I) + (5) 

= Wj .. S.V ^ Wy. hy -h W; . Sir 

= fx (-V, y, z) S.v -f/y (.V, y, z) 8y +/. (at, y, z) Sz 


all of which have exactly the same meaning. It is important to note 
that Dj:W is, however, only a conventional shorthand form for 
and the reader is advised to insert the subscripts indicating the quan¬ 
tities held constant whenever there is any possibility of doubt. 

For example with this notation it will not be generally true that 


f i/( ^*V whereas for total differentiation, where w and z are 

\dxj l\Or) 

dz \ 
dwj 


any two quantities functionally related, we must have ~ 
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If (.V, jv) are cartesian and {r, 6} polar co-ordinates we have, in fact, 

2 .V 


r = + y% 


X = r cos d. 


c r _ 2 .V _ X 

dx 2 \/{x- + V') r 


dx 

cr 


cos B = 


X 


dr 

dx 


The reason for this apparently paradoxical result is simplv that in 
calculating drjdx we keep fixed, whereas in dxj dr we keep 6 fixed. 
Since we are keeping different quantities fixed we cannot expect the 

ordinary rule — i to hold. If we keep the same quantity fixed 

it will be true, so that 

9-5 Change of variables in partial differentiation 

From the above argument it appears that we neeti a rule which 
will tell us how to change the variables with respect to which we are 
differentiatit^g. Such a rule will take the place of the “function of a 
function” rule in ordinary total difi'erentiation. 

Let us therefore consider a quatuity w winch is a function of three 
independent variables x, y, and z. 'I’he argument will readily generalize 
to cases in which we have less than or more than three independent 
variables. Suppose that we wish to rej'ilace x, and z by three new 
independent variables X, Y, and Z, which are connected with .v, y, 
and z by known relations. ('I'his includes the particular case in which 
We only replace one or more \ariables; we could change .v to a new 
variable X, leaving y and z unaltered. This will be co\ered by the 
general formula by putting Y = y, ^ ij). I low can we now calculate 
the new derivatives 

-inti 

To do this we take the general formula ( 9 . 5 ) 

Sw ^ Sa 4 Sy p dz 

and divide through by 5A', obtaining 

{hwlhX)'-{D,,,_,w){hxihX) + {IK^,,,w){hylhX) 

i (Sc/S.V). 

Now let us keep Y and Z fixed, and let hX lend to zero. 'I'hen by 
definition hwjhX tends to the partial derivative ‘tod similarl\- 

^xjhX —- D^^y.xX, hyjhX i^xw/zy imd hzjSX ^ ^^xw.yy- Also 
the smaller hX is, the smaller S.v, by, and '6z will become, and the 


r 
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nearer the approximate equality will be to exactness. Thus in the limit 

^A'l Y.Z ^ Y,ZX) + {Dy\x.z^){Dx\ Y,zy) 

- • - (9*®) 

This is the required formula. The formulas for the other two 
derivatives follow in exactly the same way: 

{^v]x.e^){^Y\X,zy) 

+ {^s\x.v^){^Y\X.Z^) 

^Z\X.Y^ ” {J^x\v.^^){^Z\X,yX) H“ {Dy\x.a'^){I^Z\X,Yy) 

+ {I^3\x.v‘^){I^Z\X,Y^)- 

The large number of sufiixes makes these formulas look rather 
confusing. If we can safely adopt the convention that Z)* stands for 
I^x\y.z and Dx stands for Dx\y,z then they become rather simpler in 
appearance, 

Dxw = {D^w){Dxx) + (Z)^tf;)(Z)A-3') + (A«')(^a'^) 

or 



cw 

dx 



dX 


dzv dz 

~d z '~dX 


or 


v,\ . Xx + -yx + tv, . Zx. 

In words, “to find the partial derivative of w with respect to a new 
variable A' we multiply each derivative of zv with respect to an old 
variable .v, _v or c by the corresponding derivative of the old variable 
with respect to tlte new one A', and add all the products so formed”. 
When expressed in this way the rule is not difficult to apply. 


EXAMPLES 

(i) w is a function of the position of a point in the plane, and is 
expressed in terms of the cartesian co-ordinates {x, y). Find its partial 
derivatives in terms of polars {r, 0). 

For simplicity let us write zv^ for DJ.^yZv = {dwjdx)y, zVy for 
Dy\jZL\ and so on. Then 

ZV^\\ -i- ZVyVr 
ZV^, = 7C^\\y + ZVyVf, 

Now if 0 is measured in radians, .v -- r cos 0, so that 

.\V — cos 0, .Y^ = — r sin d. 

Similarly y = ;• sin 0, and 

y,-- sin 0, y,f 


r cos 6. 
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Substituting these values in the equations above we finally obtain 


Wf = cos ^ + sin 6 . 

= ~ r sin ^ . a?, + r cos 6 . 


For example, suppose that 
r cos 6, and therefore 


tv = xy, then = y = r sin 0, 



tVr = cos 0 . r sin ^ + sin 0 . r cos 0 
= 2 r cos d . sin 0 
= r sin 2 0 

tVg = — r sin 0 . r sin ^ + r cos ^ . r cos ^ 
= [— (sin 0)^ -}- (cos 

= cos 2 0. 


( 2 ) Metabolism in a bacterium —The rate at which any chemical 
substance can’be absorbed into a bacterium will be proportional to 
Its surface area A. If we suppose that the substance has to be dis¬ 
tributed throughout the whole volume V, the rate at which it can be 
delivered to any particular part will presumably be proportional to AjV 

If we call this rate M, we shall therefore have M = KAjV where K is a 
constant. 

We may be interested in the question of how M is affected by 
changes in the shape and size of the bacterium. For simplicity suppose 
that it is of cylindrical form with length L and radius R with two 
hemispherical caps at the ends (Fig. 9 . 2 ). We shall require the formulas 



Fig. 9.2—Volume and area of a bacterium 

for the areas and volumes of a cylinder and a hemisphere (see Sections 
11.9 and II. 10 ). The volume of a cylinder is equal to the area of its 
cross-section times its length, i.c. in our case nR^L. The volume of 
a hemisphere is IrrR^. The total volume of the bacterium will there- 
fore be + i„R^ = %-„R^ + „R^L = V. The area of a 

hemisphere is 27tR^. 'I’he area of the curved surface of a cylinder is 
z-itRL. (For if it is slit along a line parallel to the axis it can then be 
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Opened out into a rectangle of length L and breadth equal to the 
circumference -zttR of the cylinder.) Thus the total area is -{- 

27tRL -f zttR- = ^.ttR- + 2 itRL — A. We have therefore 

M = KAjV = K UttR^ + 27rRL)l{^7TR^ + ttR^L) 

= 2L)l{iR^ + RL) 

on cancelling out the factor ttR from numerator and denominator. 

To determine what effect changes of R and L have on this we must 
find the partial derivatives Dj^^M and or, as we shall denote 

them for conciseness, Mr and Ml. We can do this in several ways. 
One is to differentiate the expression directly as it stands. Another 
is to use a two-stage process, analogous to the “function-of-a-function” 
rule for the single independent variable. We write n = 4 /? + 2 L, 
V = ^R^ -j- RL so that M = Kujv. We can first differentiate M 
partially with respect to u and and then use the formulas 


Mr = MjiR + M^Vr 

Ml = M^^Ul + M^Vl 


to find the derivatives with respect to R and L. (Here is short¬ 
hand for Z>u| yM and M^. for D^,\^^M.) 

Since M = Ku/v, we have at once M^ == RL)y 

and M., = - A'«/r 2 = - ^(4^ H- 2L)I{%R^~ H- RLf. 

Since u —- 4 /? -f- 2 L, Ur = 4 and Ul = 2 . Since v = %R- -f- RL, 
Vr = flT? + L and i'l = R. Substituting these values in the equations 
we obtain 

4 A' K {m + L){ 4 R + 2 L) 

^ IR^ + RL (S + RLY 

-- ~ 6 K ( 8 i?^ -r SAL + 3L2)/(4A + -^Lf 

and 



2 K K ( 4 A -h 2 L) R 

fR^ -f RL (tR^ + RLY 


= - i2A7(4A + 3^)"- 

If now R changes by a small amount 8R and L by a small amount 
5L the expected change in rate of metabolism A1 will be 8AI — A'IrBR-^ 
.'V// 5L. As both Mr and Ml are negative, this means that we can 
expect an increase in either the length or the radius to slow down the 
process. 'I'his is in accordance with common sense, since an increase 
in size will make the interior to some extent less accessible from the 
.surface. 


( 3 ) Given the internal energy L of a gas as a function of its tem¬ 
perature and volume V, find the rate of change of E with respect to 
the temperature when the pressure P remains constant. 

We shall use the notation Lj-,for DjiiE ~ {dEj dT)y, and similarly 
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for other derivatives. We know £* as a function of T and F, and there¬ 
fore we know Etw We wish to change the independent 

variables from T and V x.o T and P\ the appropriate formula is 

Et\p = Ej-\y Tt\P + Ey\T V-j-^p. 

Now if n is the mass of gas measured in molecular weight units (e.g. 
moles) then the gas law states that 

PV = nRT 


or, expressing T and V in terms of T and P, 

T=T\ V=nRT(P 


whence 


Tt\p — I; Ft'ip — fiR/P 
and, by substitution in the formula, 

Et\p ~ Ex\y -b tiREi-irlP 


Observe that in this formula the suffixes indicating the quantities 
held constant cannot be omitted, for without them there would be no 
way of distinguishing between E-pip and Er\y. 

It is not possible to go any further by purely mathematical reason¬ 
ing. However, there is a classical experiment which shows that if a 
gas enclosed in a thermally insulated container is allowed to expand 
into a vacuum, in the final state when it has again come to rest the 
temperature returns to its original value. 'I’hus it has undergone a 
change in volume without change of temperature: BT = 0 . Now since 
the container is insulated no energy is gained or lost by the gas in the 
form of heat, rurihermorc it docs no work on its surroundings, so 
that there is no change in interijal energy. 'I’hus at constant temperature 
hE = o, so that BE/BV - o. On taking the limit as BV o we have 
Ey^r= o. On substituting this in the above equation we find that 
in fact 

Et\v = Er\p .... ( 9 . 9 ) 

This is, however, a physical law and not a mathematical identity. 

( 4 ) The specific heats of a gas —Suppose that we take a unit mass of 
gas (in any units, not necessarily molecular weight units) and warm it, 
imparting to it ati amount of heat BQ with the result that its tempera¬ 
ture is raised by an amount BT. 'I hcn the ratio BQ/BT may be called 
the “average specific heat of the gas” over this range of temperature, 
since it represents the average amount of heat required to raise unit 
mass through one unit of temperature measured over this range. If 
this average BQjBT tends to a limit c as BT o we shall naturally 
call this limit the actual specific heat at temperature T. It will, however, 
be important to state the conditions under which the heating takes 
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place; and in particular we can distinguish between the specific heat 
Cy when the volume is kept constant, and the specific heat Cp at constant 
pressure. (We can in theory make the same distinction for solids and 
liquids, but in those cases it is usually the specific heat at constant 
pressure that is meant: it would require very unusual conditions to 
keep the volume constant.) 

Now when the volume is kept constant the gas can do no work 
on its surroundings, so that all the heat energy SQ received goes to 
increase the internal energy of the gas. Thus SQjST — 

SE/BT at constant volume, and accordingly in the limit as BT o 

Cy = Ex\y .... (9.10) 

On the other hand, if the gas is kept at constant pressure then as 
the temperature rises there will be an expansion by, say, BV. This will 
do an amount of work PBV against the pressure exerted on the gas by 
its container. Thus the heat energy BQ supplied will go partly towards 
increasing the internal energy by an amount BE and partly to doing 
work PBP\ or in other words 

BQ = BE PBV 

at constant pressure. Now divide this equation through by BT and 
take the limit as S7' -> o; BQ/BT will tend to the specific heat at con¬ 
stant pressure, and 

Cp = Ex,p + PVp^p . . . ( 9 -^ 1 ) 

The equations (9.10) and (9.11) will hold for any substance whether 
gas, liquid, or solid: they amount in practice to definitions of the 
specific heats at constant volume and constant pressure in terms of 
partial derivatives. For a gas we have by (9.9) Et\p = Ep^y = Cy. 
Furthermore it tn is the molecular weight of the gas we are considering, 
then in unit mass there are \ ‘.m molecular weight units, and therefore 
PV' ~ RT m. or I - By differentiation we find Vq'\p = 

R,mP or P\'xyp~R m. Substituting these values in the equation 
(9.11) we obtain 

Cp = Cy 4 R m .... ( 9 . 12 ) 

the classical equation connecting the two specific heats. 

Note. In this equation we suppose heat energy and mechanical 
energy to be measured in the same units. If heat energy is measured 
in calories and mechanical energy in joules, there being J joules in a 
calorie, then a specific heat Cp measured in calorie units will be equiva¬ 
lent to one of Jcp measured in jo\i!e units, and similarly for Cy. Thus 
equation ( 9 . 12 ) must be modified to 

Jcp = Jcy f R:m 
or Cp — Cy ----- R mj 
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(5) Let z be any function of any two independent variables, x, y, 
so that z = f {Xy y). Suppose that this relation can be rearranged to 
give X in terms of y and z, then provided that D^iyZ ^ o, the formulas 
for the partial derivatives of x are 

Dy\2X = D j.^yZ 
D,\^= 

Proof. We shall write Zy\x for and similarly for other 

derivatives. Thus we have to prove that .v^i* = — = 

i/- 3 'xlv> quantities Zx\v and Zy\x being known quantities and Zx\ y ^ o. 
(The condition Zx\y o has to be imposed to prevent the fractions 
having zero denominators. It can be shown that when z^iy o we 
can in fact always consider a: to be a function of independent variables 
y and z; but the proof is somewhat complicated.) 

Now the second relation x.\y= ilz^w follows immediately from the 
ordinary rule for derivatives, D;X = ijDjZ, since we are keeping the 
same quantity y constant in both cases. 

To prove the first relation A-y,- = — Zy^xi^zw "e consider the 
formula for change of variables from x, y to z, y. This is, for any 
quantity w, 

^V\Z -^^v\xyv\z 

sincethe derivative ofy with respect to^', is i (the suffix indicating 
that z is constant is irrelevant in 

This holds for any function w. 'Fake zv to be z, then zVy\. = Zy\. = o, 
since this means “the rate of change of z when z is constant”. Therefore 

O = Zx\yXy^. + Zylxt '‘C- 
■2'ar|v*''vU “ ~v\z- 

Since z^iy is not zero we can divide through by it obtaining the 
required formula a:„|. == — Zv\zl^ztv 

If Zy\x ^ o this formula can be expressed in a very symmetrical 
form, for then Zy\x = i/yz \x by the usual rule that DyZ = ijO-y, the 
same quantity x being kept constant in both cases. 'I'hus we have 
Xy\t = — ijy,\xZx\v multiplication through by y-st^xiv this 

becomes an identity 

XvU yz\x ^x\y ~ t 
PROBLEMS 

(1) If the bacterium of example (2) above is cylindrical, with fiat 
ends instead of hemispherical ones, what is the eflcct on M = KAjV 
of a small change in the length and radius? 

(2) Differentiate zv = tan (jc ^) + tan (a;—>-) partially with 
respect to x and y. [Change the variables to (x + jy) and (.r ~ y).'] 
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(3) Differentiate to = In (*"- + y^) - In (*= - partially with 
respect to x and y. 

9.6 Formal definition of differentiability 

In the discussion above \%'e have used such arguments as that if 
is a function of .v and y, and if Sa: and Sj are small then is very 
nearly equal to w,u8.v + or as we can write it here, + 

Wyby, where and w, are the partial derivatives of w with respect to 

a; and v respectively. 

We have also said that the smaller 8.v and 8y are, the more accurately 
this formula holds. Now it is rather unsatisfactory to use such a vague 
term as “very nearly equal”: it would be better if we could say more 
precisely what was meant. So, for the sake of logical accuracy, we shall 
examine more carefully the error 

E = (oJj-S.v + — Sw 

in the formula for a small change. (The reader who is interested in 
the practical rather than the logical aspect can omit this discussion.) 

E is the difference between the calculated change {tv^Bx + 
and the actual change hw in w. Roughly speaking what we want is 
this error to be ver\' small compared with the small changes S.v and Sy. 
L'nfortunatelv we cannot say that htv is always small in comparison 
with 8 .V and also in comparison with Sv. For S.v and Sv are independent 
quantities, and we can alwavs if we wish take S.v to be much smaller 
than 8v, or even zero. But if we put S.v = o the error E will be 

which will not necessarily be zero, and if not will be very 
large in comparison with the zero quantity S.v. Thus this first and 
simplest attempt to clarify our ideas does not work. 

We now make a second attempt, and say that we want E to be 
small compared with jS-vj + ISvj. 'I’his docs not run into the same 
objection, and we can put S.v - o without encountering difficulties. 
F<jr we then have 

E = TVyhy — Sze, i.e. f 'Sy ~ Wy — Szii/Sy. 

Since zc,„ the partial derivative, is by definition the limit of BwlSy as 
8v ' o -when .v is constant (S.v • ^ o), it follows that the smaller By 
becomes the smaller is the difference between zVy and Bwj’By, so that 
the smaller is EjBv. In other words, the smaller we take Sv to be the 
smaller the error E is in comparison with Sv. So far we seem to be on 
the right track. 

We can now go further and say that we want E to become small 
in comparison with |S.v| + jSyl when jS.vj T |Sy| becomes very small. 
Put more formallv, what wc would like to be true would be that 




ze^Sy) — 
T |Sy| 


Szi; 

--- = o 


• (9-13) 
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as |S.-c| - 4 - \ Sy\ o. For when (9.13) is true we shall be completely 

justified in using the approximate formula Szv tv^Bx + 

We can show that the property (9.13) does hold for various simple 

functions. Let us take some examples. 

Consider first the function w = x then Wj, = i, w^ = 1, and 

therefore E = Bx By — Sw. Now with our usual notation 


Sx = X 2 — ^ 1 , Sy = ja — >’i. 

Bw = zv 2 — 1 = (^'2 -b J 2) — (•'■'1 "byi) 

= (JC2 — ^1) + (>’2 —>■1) = Sjc + By. 

Thus E — o and so lim E/(|Sxj +18 y|) = lim 0 = 0. 

Secondly consider w = .v- +3'". Then = 2.v, = 2_y and 

E — 2.V, Bx + 23'i Sjv — Soj. Now 

Bw = ZU2 — zoi = {x^-\-yz )—(•’f +3'i") ~ *' 1 ") 

= {X2-\-X,)iX2-X,) + (y2+yi)(>’2-3'l) = (^2 +-Vi)8-V+(j«+3',)^>’- 

Therefore 

E = (.■v,-X2)S.v T (y,-y2)Sy = - (W 


and 

l^l = ISxl^+lSyl'^ < 1 S.v|2+2|S.v 1 ISyl+lSjj^' = (| Sx]+| Sy])’. 
Dividing by the positive number |8.v| + 18 y| we have 


1^1 


< \ ^x\ + |8yl 


I 8 x 1 + |S >’1 

and so as |8xl + |8yl -> o the left-hand side must also tend to o. 

Finally consider the function zv = xy, for which w, == y and zv^ ^ .v. 
Here E = y,8x + x,8y — 8tt;. But 


Bw = ZO2 — w^ = x^y^ — x,yi = (x, -f Sx)(y, By) — Xiy, 
= jViSx + x,8y T 8x Sy, 
and therefore E = Bx . By, Now 

(|Sx| + |5y|)2 = ISxl^ + 2 lSxl |8y| -f- ISy^-, 


and therefore 

\E\ = |8x| 18 y| <(|8xl + |8y|)2. 


Thus 


__ 1^1 __ 

18 x| + [Syl 


< 18 x| + |8y| 


and must tend to zero as |8x| T 18 y| tends to zero. 

In this way we can verify that equation (9.13) holds for all functions 
that one meets in practice. Accordingly we make the following 
definition. 
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Definition of differentiability of a function of two variables. If a 
function w of two variables is such that equation (9-t3) holds good as 
I ^ o for any particular values of x and y then w is said 

to be “c ifferentiable with respect to the two variables” for those values 
of X and y. 

In the same way if IF is a function of three variables Xy y and s, 
we shall say it is differentiable when 

— it ° 

15 .v| + |8y| + \ Bz\ 

and when that is so we can safely use the approximate formula SIF=s: 
Ij/^Sx -h W„Sy + lV~ 8 z. This definition of differentiability allows us 
to give formal and logically accurate proofs of our formulas for differen¬ 
tiation. For example, if IV is a differentiable function of .v and y, and 
x and y are differentiable functions of X and Y, then the formulas for 
change of variable 

Wx = IVy^xx + Wyyx 

Wy = IFpry + IV^yy 

(where IVx = Dxw^V, etc.) can be shown to be universally applicable. 
In principle all we have to do is to repeat the argument we have already 
given, but instead of saying “8IF is approximately equal to IF^S.v -j- 
IVyby" we can now use a more precise definition in terms of limits. 
But we shall not pursue the matter further here. The important 
question for practical purposes is “when is a function differentiable, 
so that we can apply the required formulas?” The answer is that in 
practice all continuously variable functions can be considered to be 
differentiable. 'Fhere are a few exceptions and qualifications, but they 
are almost always obvious. The function i/.v will not be differentiable 
when X -- o, since it then becomes infinite; the same applies to the 
function log v. The ftmction (.v- — y") '(v“ + V“) of the two variables 
X and V will not be differentiable when .v = y — o, since it then becomes 
0, 0, which is indeterminate. A function which varies discontinuously, 
such as the population of a country’, is not strictly speaking differentiable, 
although in practice it may be sufficiently nearly represented by a 
differentiable function. All these exceptions are fairly evident and will 
normally be avoided with sufficient care. In theory there are more 
subtle snags, but in practice these rarely if ever arise. 

One further comment mav be made. The definition of differentia- 
bility tells us that we can make the error in the formula 8 w = WjSa; -h 
rCy 5 y as small as we like in comparison with |S.v| + |8y| by making 
5.V and Sv sufficiently small. It does not tell us how’ great an error w’e 
can actually expect in any given example, at least not directly: but w’e 
.shall later find formulas which answer this question. 
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9.7 Constrained variation 

When a gas is free to vary in volume {V), pressure (P) and tempera¬ 
ture (P) we know that any two of these can be chosen as independent 
variables, since they are connected by the gas law PV tiRT. Any 
property zv of the gas, such as its internal energy, its specific heat, its 
thermal conductivity, or its entropy, can be expressed as a function 
of two of the variables, say the pressure P and volume V. We can 
therefore calculate the partial derivatives and Dy\pW oi to. 

Now imagine the gas contained in a rubber balloon. I his imposes 
a further constraint on the gas, since the greater the volume the greater 
the pressure exerted by the balloon. Thus there will now be a definite 
relationship between pressure and volume, and they can now no longer 
vary independently. (Actually the situation will be complicated by the 
fact that the properties of the balloon are affected by temperature, but 
to avoid complicating the argument we shall ignore this effect. We 
shall also suppose that the external atmospheric pressure is effectively 
constant, so that all changes in pressure are due to the stretching ot 
the balloon.) Then corresponding to each volume of the balloon there 
will now be a single value of the pressure, a single value of the tempera¬ 
ture, and a single value of the property w\ all quantities are now 
functionally connected. We can no longer speak of a partial derivative 
Dp\yto, the rate of change of w when V is kept constant; for if V is 
kept constant, P would also be constant, and Dp\yto is a contradiction 
in terms. We can now only calculate the ordinary or total derivative 

DpW. 

Another case in which such a constraint would occur would be that 
of a mass of gas (not necessarily within a balloon) whose volume and 
pressure were changing with time. We could imagine recording 
instruments plotting graphs of the volume V and pressure P against 
the time /. P and V must now be considered as functions of the single 
variable and any other property of the gas, such as the temperature 7 ', 
will also be a function of the single variable t, since it will be determined 
when t is known. Now suppose that we wish to estimate from these 
graphs how rapidly the temperature T is changing (with respect to 
the time). One way to do this would of course be to calculate from the 
gas law the values of T for each value of P and V and plot a graph ot 7 
against the time or if P and V could be expressed as mathematical 
functions of /, we could do this by substituting these functions itt the 
equation 7 ’= PVjnR, and thereby express T directly in terms ot t. 

However, we can often tackle the problem mt)re etficiently other¬ 
wise. We know that in general, whether there is an extra constraint 
or not, the gas law 7 * — PVfnR must apply. If the gas is unconstrained 
we have therefore that small changes hP and In the pressure and 
volume must correspond to a small change 6 T in temperature given by 

sr^(Dp,^.r) sp +{Dy^pT) sv 
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But since T - PF/z/i?, Dp\yT = VjnR and Dy^pT = PlnR, so that 

bTc^iVjnR) SP + (P/«P) BV. 

But written in this form the relation is simply a deduction from the 
gas law which must hold good however the pressure, volume, and 
temperature vary. It must therefore be equally true for the constrained 
system as well as the unconstrained system. Now divide through by 
8i; we obtain 

(Sr/S/) ^ {VjvR) (SP/8/) -h (P/'«P) (SF/S/). 

Take the limit as Bt tends to zero. 


D,T = (F/;/P) DtP + (P/«P) DtV. 

This therefore is the equation we require which enables us to calculate 
the rate of change of T when we know the rates of change of P and F. 
But we calculated the coefficients (F///P) and (P/wP) as the derivatives 
Dp\yT and DyipT for an unconstrained mass of gas: so we can write 
this relation in the form 


D,T = (Dp, yT) (D,P) + (DripT) (AF) 

« 

or alternatively in suffix notation 

Tf = Tp\yPt + Ty^pVt 

with the interpretation that Pp,r and Ty^p are partial derivatives 
calculated before the constraint is applied, and T,, P, and F, are the 
derivatives for the constrained system in which P and F have become 
functions of the single variable /. 

d'his is a general principle. Suppose that w is a function of several 
variables .v, v, z. . . . Then so long as they are all allowed to vary 
independently, 

8zc ^ y., .., 5.Y rcyu.r ... By Bz ^ . 

Now this equation must hold good however 8.v, By, and Bz vary. It 
must still be true if S.v, 8v and Bz are limited bv some relation, pro¬ 
vided that we calculate the values of ctvi , w’yix.r ... . ix.t/.. . • • • 
for the free and jinlimited system. 'I’hus if .v, v, z . . . are all con¬ 
strained to be functions of a single variable t, then 


BzvjBt ■>- ... Bx ... By'Bt-^u',,j,,y... Bz/Bt + - . - 

and in the limit as Bt — o 


... -V, + ... y, ^ .. ;x, + . . . (9.14) 

If we can safely omit the suffixes indicating the quantities held constant 
this equation takes the simple form 

w, ft^Y, 4 4- n\z, 4- • • • 
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This equation is now exactly of the same form as that for a change 
of variable given in Section 9.5, which is a useful aid to the memor>'. 
But the interpretation is somewhat different. In the change-of-variable 
equation all symbols represent partial derivatives, whereas here 
Wy, . . . represent the partial derivatives calculated for the uncon¬ 
strained system, and w,, Xf, . . . are total derivatives for the 

constrained system. In the dwjdt notation we would write 


dw 

Tt 


dw dx dw dy 
dx 8t dy ct 


+ 


cw cz 


bz ct 



for the equation for change of variable, and 


dw 

Tt 


dw dx dw dy , ^ 

dt dy dt dz dt 



for the constraint equation. 

These equations can of course be proved more rigorously by using 
the formal definition of differentiability given in the previous section. 
The method of proof will be essentially the same, but it will be expressed 
more precisely in terms of limits. We shall not concern ourselves with 
these complications here. 

'i'hese equations also enable us to solve the balloon problem. 
Suppose that w is any property of the gas which can be expressed in 
general in terms of its volume V and pressure P so that we know the 
partial derivatives w^,^lf and «;,'|/.(for the unconstrained system). We 
now suppose that because of the constraint imposed by the balloon 
the volume V becomes a known function of the pressure P\ we wish 
to calculate the derivative 7 Vp of w considered now as a function of P 
only. Our formula gives 

W/> = Wp\y Pp -|' Wy^p V p. 

But Pp = DpP = I, and Vp is supposed known. 'Fhus the answer is 

Wp ^ Wp\y -j- ^y\v^p‘ 


Suppose that the quantity we are interested in is the temperature T. 
It has already been shown that Tp\y— P/nR, Ty^p - Vj'nR, so that 
Tp -- VjuR i PVpjnR (V -\- PVp)(hR. Suppose further that the 
properties of the balloon are such that its volume K a/(^ — Py^ 
where a and fi are constants. Then by direct differentiation 

Vp = Vi^i^pi^ - p)p - 3 «/(^ - 

and therefore 


V -i- pvp - a/(^ - py + 3«p/(^ - py = « {fi + zpyifi ~ p)\ 

and 

Tp - « {fi -}- zPynR {fi - py. 
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This shows how rapidly the temperature changes in relation to the 
pressure: a small change 8P in P corresponds to a small change ST — 
TphP in T. 

PROBLEMS 

(1) Consider the bacterium of example (2), Section 9.5, consisting 
of a cylinder of length L and radius R with hemispherical caps on the 
ends. Supposing that the length L increases proportionately to the 
time, L = a?, whereas the radius increases only as the cube root of 
the time, R — where a and /S are constants. How rapidly do the 
area and volume increase? 

(2) A body thrown through space has cartesian co-ordinates x 

and y ; the distance r from the origin is given by r = + y^)' 

Find r^iy and r,,]*. If x and y vary according to the laws x = 2t, 
y = p — 5P, where t is the time, find the rate of change of r. 

(3) The intensity of illumination I of the floor of a room due to a 
lamp at height h is P/(a:^ + >'" + h-), where B is the brightness of the 
lamp and x and y are cartesian co-ordinates of the point P at which 
the illumination is measured, the co-ordinate axes being two per¬ 
pendicular lines on the floor intersecting at O directly under the lamp. 
If — 2 metres, x and y also being expressed in metres, find the partial 
derivatives with respect to x and y. 

A man is walking across this room looking for a lost coin. If his 
position at time t seconds is given by x = it — 10, y ~ it — 5 y find 
the rate at which the illumination of the floor under his feet is changing. 


A constraint may not be so complete as those we have considered 
above, in which every quantity is reduced to a function of a single 
variable t. If a system has originally three independent variables Xy 
y, and z, it is conceivable that a relation imposed on them might 
reduce them to two independent variables only. For example, con¬ 
sider gas contained in a vessel with an inlet tube. Normally we can 
vary the pressure, the volume and the mass of gas in the vessel inde¬ 
pendently. If, however, the vessel is a balloon then there will be a 
relatioi\ l'>elwcen the pressure and the volume, and only two variables 
such as the pressure and mass will he independently variable. The 
rule for this case is ver)' similar. If zv is a function of three variables 
.V, y, and z, and if .v, y, and z are constrained to be functions of two 
variables t and «, then when zv is considered as a function of t and u 
its partial derivatives are 

^ilu ^jri ^\i\x.syt\u “b 

^x \ v.r'^'wi ( ~b ^1/1 J.jJV'u! t d" I x. ( 

or if we can safely drop the extra suffixes, 
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Wt = w^t + w„yt + w.Zt 

+ Wyju H- o’zS-u 

Here w^, Wy, and zo^ are the three partial derivatives calculated 
before the constraint is applied. These formulas are of course of exactly 
the same form as those we have already had for constrained systems 
or change of variable without constraint, and the argument follows 
the same course. Either (or both) of the new variables t and u may 
be the same as one of the old variables x, y or z\ the formula will still 
hold. 


FURTHER PROBLEM 

(4) If the mass of gas in the balloon is variable, and the pressure 
and volume are connected by the relation V = a /(/3 — P)^, what are 
the partial derivatives Tp^n and Tn\p} 

9.8 General form of constraint 

Sometimes it may happen that two independent variables x and y 
have a constraint imposed on them which cannot be easily expressed 
in the form "x = a function of t, y = ^ function of For example, 
we might find that x^ xy y^ = o. If zv is any function of x and y 
we might want to know the derivative w* = after the constraint 
is applied. Now we know that 

= aJxlv + ^v\xyx 

where Z0j.\y and are* the partial derivatives in the freely varying 

case. We can therefore solve the problem provided that we can find 
jVj. = Dj.y for the constrained system. 

The technique to be applied to this type of problem is as follows. 
Let the constraint be expressed by a relation which we can write 
/ (*. >') = o. 

Now consider for the moment the situation where x and y can vary 
independently. / (x, j) will then be a given function of x and y, and 
a small change Sx in x and Sjy in y will produce a change S/ in the 
value of / given by 

¥-/*U ■ Sjc +/u|, . By 

where/,|y and /y|* are the partial derivatives calculated zviih x and y 
independently variable. We now reimpose the constraint/ = o. Since 
/ is now constant, Bf = o and therefore Bx + /v\xBy o, or on 
division by Bx 

fx\y +/v,x{ 8 :v/ 5 x)=iO. 

Take the limit as Bx o 


f x\ V "h f y\xyX — 
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\{ fy\x ^ o we can divide through by it, obtaining finally 

Vx = — fx\v!fy\x • • • (9-^5) 

(This can alternatively be deduced from the formula Xy\- = — Zy\JSx]v 
of Example 5, Section 9.5, where s is any function of x and y. Write 
y, -v, / in place of .v, 3', z respectively; the formula then becomes 
= —/xu//«u- Butjxi/ means the derivative of y with respect to 
.r when / is constant, i.e. the derivative 3'x under constraint.) 


EXAMPLES 

(1) If X and y are connected by the relation x- y^ = K find 
y^ = D^y. 

Write f{x,y) = x- +3’- — K: then this relation is / (.v, y) = o. 
Now when .v and y vary independently/xi „ = 2X and/yi, — 2y, and 
therefore in the constrained case 3’x = — 2.V/23’ = — xjy. 

(2) If .V andy arc connected by the relation + e*' = K find y^. 

Write /(.V, y) = -r — K. Then the constraint is/(.v, y) = o. 

Now/^,„ = e', /y,j, ^ e", and therefore y^ = — 

'riie same technique can be applied to more complicated cases with 
more variables and/or more constraints. A curious feature of this 
formula is that the partial derivatives fj^w fv\x need have no real 
siguificance in themselves, but may be only stepping stones in the 
process of finding the derivative v^.. If -v and v are in reality connected 
by the relation / (.v, v) o (as, for example, .v- -f-3'" — K - o) then 
in realitv thev cannot var\' independently. But in imagination we can 
lift tliis restriction for a time and find what will happen to/ when they 
ilo so. In the end the restriction is re-imposed and yv calculated for 
tile real and limited system. There is no harm in arguing in this way 
pro\ iiled that what is being done is clearly understood. But it seems 
necessary to make these remarks since the process is sometimes used 
without adequate explanation and can at first appear not a little baffling. 


FURTHER E X A M P L E S 

(3) 'I'he growth of a colony of bacteria obeys the law // 
where n is the number of bacteria per cubic centimetre at time t, N 
ilte initial number per cubic cm when / = o, and is a constant. 1 his 
law iiolds provided that the concentration « is sufficiently small for 
there to be no appreciable competition between bacteria or exhaustion 
of the food supply. 

Now suppose that we make an error S« in measuring ;/, in 
measuring N, and 5 / in measuring t. What will be the effect of these 
errors on the value of k calculated from the obsen-ed values of «, N, 
and / ? 
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There are several ways of solving this problem. One would be to 
solve the given equation for ky obtaining ^ = (In tj — In N)lt, and then 
to apply the formula for a small error. 

We can also derive the formula directly from the relation n — N€^*=o. 
Let us put n — ATe** =/ (n. AT, k, t). Then if for the moment n, N, 
k and t are imagined as independently variable we shall have 

hf = /„8w + yW^AT fk^k 

where the partial derivatives are 

/„ = I, fN=- h=- ft=- 

Now when we apply the constraint / = o we shall ensure that S/ = o, 
that is 

Bn — 8 N — Nte^^ Bk — Nke^* 8f = o 
or AT/c'-* Bk= Sn ~ BN — Nke^^ Bt 

, Bn BN kBt 

“ iv^ ” Nt ~r 

which is the formula we require to calculate Bk from Bn, BN, and Bt. 

(4) Given that x, y, and z are three variables connected by the 
relation x^ z"^ = r’ = constant, so that only two of the variables 

are independent, find and 

Write f{x, y, z) = x^ + y® + z^ — r*. Now imagine for the 
moment that x, y, and z are allowed to vary independently: then 

Bf Ca fBX ~\~fv\X.Z^y 
=af zxBx + zyBy + 2:2:82:. 

Rc-impose the restriction /=o, then 8/=o, i.e. x8A:+3/8y+^S2 '=eo. 
Solving this equation for 8s: we have 

82: =2 — xfz . 8 a: — yjz . By. 

Now Zg^y, is by definition the limit of BzjBx as 8 a: o with y con¬ 
stant, i.e. By = o. But on putting 8jy = o the above equation gives 
82: =as — xjz . Bx, i.e. Bz/Bx =; — xjz. By taking the limit, we obtain 
2r.,y = — xjz. Similarly 2:^,, == — yfsi- 

Alternative method. Use the formula for a constrained system to 
calculate fxw- 

y*i V “ y*i w.*^*i V “b/vi V "by* I V 

where fx\v.t = 2a;, ft\x.v = 22: are the derivatives cal¬ 

culated for the freely varying system. But with the constraint f = o 
we must have /x\v — o. Also Xx\y = i, and = o, since it is the 
rate of change of y when y is kept constant. Thus the equation for 
y«lv reduces to 

ZX 4-0 -\- ZZ . Zx\y=^ o, 

that is, Zx\y = — xjz. 

Q 
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RELATIONS INVOLVING RATES OF CHANGE 


lO.i The relation between position and velocity 

We have seen above that we can always find the instantaneous 
velocity of a moving body when we know an equation connecting its 

position (measured by a co-ordinate y) and the time f We wn^ 
I = or y„ or dyidt. The same applies if y is any variable quantity 

which is a function of t; we have a set of rules by which we can find 
the instantaneous rate of change ofy^ This we have called the derivative 

of r with respect to the time t. . 

Can we reverse this process? If we are given an expr^sion for a 
velocity, can we deduce one for the position? If we know how rapidly 
a quantity is changing, can we find out what the quantity itself is? 
This is an important question, since it is, for example, fairly easy to 
measure the position of a point with reasonable accuracy, but very 
difficult to measure its instantaneous velocity. If we are able to deduce 
a value of its velocity from theoretical considerations, it is much easi^ 
to check whether the theory agrees with observation if we can convert 
it into one giving the position of the point as a function of the time. 
Again, in a chemical reaction it may be easy to deduce how rapidly 
the concentration of one of the reacting substancp is increasing: bu 
we cannot very well check this experimentally until it has been expressed 
as a relation between the concentration itself and the time. 

Suppose that the quantity y is given by the law y 

instantaneous rate of change is v = yt-= D,y — zt. If 

are given instead the law for the rate of change = 2/, we can deduce 

that one possible expression for y would be t- But is ^ 

possible value of v? The answer is no: the law y = t-^ 1 would 
also give the rate o'f change ^ - zt, since the derivative of the constant 
I is zero. The lawy = r- + 15 ''ould also gi\^ the same rate of chimge 
v = v, = zt: and in general the lawy = r- + C where C is any coi^tant 
will give ^ = zt. Thus + I. + 15 and in general /- + C are 
all possible values of v which satisfy the law that the rate of change 
of V is zt. This is perhaps not surprising, for if we are given the rate 
of change of r at each time t we can only expect to be able to inter the 
amounts by 'which y changes from one time to another; we cannot 
expect to infer the absolute value of y. Thus the equation y, = zt has 
the solution y = /- + C, where C is a constant. We can find the value 
of C when we are given as additional information the co-ordinate y of 
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the point at any one instant. For example, if we know thaty = o when 
^ = o, on substitution in the equation y =■ t- C we see that o = 
o 4- C, i.e. C = o. In fact in this particular case C is simply the value 
of y when t = o. 

We can perhaps illustrate this from one of the less well-known 
adventures of the famous detective Herlock Soames. “Follow that 
car,” said Herlock, jumping into a taxi, “it is driven by Professor 
Moronami, the notorious diamond smuggler. But be careful! He 
sometimes carries clubs too. I advise you to go just as fast as he does: 
no quicker, no slower.” So off started Moronami, and off started Soames, 
100 yards behind. And when Moronami accelerated up to 50 miles 
per hour so did the taxi, and when he slowed down to 20 miles per 
hour, so did the taxi. And their speeds were always exactly the same. 
And what happened? Well, as the reader will have guessed, they 
always remained exactly 100 yards apart. Now Moronami was very 

Pcfilton 



Fig. lo.i—PosiCion-time curves for 3 points moving with equal velocity 

fascinated by this fact and (keeping his club handy) he tried to work 
it out by the theory of limits. “Suppose my velocity is thought he, 
“then that means that lim hy/Bt = and therefore there is a function 
/(e) such that . . But we regret to record that at this moment 
Moronami failed to negotiate a difficult corner and crashed into a lamp- 
post. And of course the taxi driver, obeying his instructions, followed 
suit and ran straight into another lamp-post, exactly 100 yards behind. 
And so they all were taken to hospital, and when they were recovering 
they passed the time by playing bridge with the Matron. And Soames 
still wondered how it was that Moronami and his partner always made 
five diamonds or five clubs, if not a slam. However, that is not our 
business: the moral as far as it concerns us is that two points whose 
velocities are always equal will move so as to remain a constant distance 
apart. Unfortunately, as no doubt Moronami discovered, this is by 
no means easy to prove formally. The reason is that if we deal with 
instantaneous velocities we have to cope with limits, and the definition 
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of a limit is quite a complicated matter when written out in full. So 
we shall dodge the difficulty and take the result as proved. The 
importance of this result is that if we are told, for example, that v — zf, 
and find that y = iMs a possible solution, then we know for certain 
that any solution differs from this by an arbitrary constant, and is 
therefore of the form + C. This includes every possible value of 
and nothing is overlooked. The same applies to any other formula 
for V. This is seen from Fig. lo.i, which shows several graphs for y 
with equal values of the velocity v. Any two of these graphs are 
separated by a constant distance in the vertical direction. 

10.2 Integration 

The process of finding a quantity y when its rate of change v = yt 
is known is called “integration”, and y is known as the indefinite 
integral of zj”. It is the opposite of differentiation, by which we find 

V from y‘. and y could also be called the “anti-derivative of v. The 

reason for the name “integration” is this. In differentiating a variable 
y we take a small change Sj' in y corresponding to a small change 8/ 
in the time /, and find die velocity v by the formula w = Sjy/St. More 
exactly the instantaneous velocity is the limit of this when Sz-^o; 
but this doesn’t matter very much, since when M is small enough 
hyjht will be inappreciably different from v. Conversely if v is known 
then we can find how much y changes during a sufficiently srnall 
interval of time Sf by using the formula and by adding 

together all these small changes we can find the whole or “integral 
change in y for any interval of time, however large (Latin integer == 
whole, complete). "This is a possible method of integration, but it is 
very laborious. 

A much better method of integrating a velocity or rate of change 

V is to look for an expression whose derivative is equal to v. This is 
a matter of inspired guessing: but there are certain rules of integration 
which arc verv helpful. T.’hus v ~ since we know that 

we deduce that y = + C. If — cos /, since sin t = cos t, we 
know that \ — sin / + C. If = i/^t since Dt In / — jjt, jy = In / + C. 
The constant C must never be forgotten; it is because of its presence 

that we call y the indefinite integral. 

^o\v \vc know that wc cun deduce the denvutive of from thut 
of t- by the rule that when a function is multiplied by K so also is its 
derivative. Thus since Z>,/= = zt we see that D^Kt- == zKt. Looked 
at the other way round this means that since t~ is an integral of 2#, 
2t‘ must be an'integral of 4 /, and Kt~ must be an integral of zKt. 
I'o get the general integral we must add a constant C, giving Kt- + C 
as tile general form of the indefinite integral of zKt. Similarly since 
/)( sin t = cos t we see that D, (AT sin t) K cos /. Interpreted 
in reverse this means that sin t is an integral of cos and K sin t 
is an integral of K cos t. The general integral of K cos t is therefore 
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K sin t H- C; e.g. the integral of ^ cos / is J sin t + C. In general 
we can say that if y is an integral of v, then Ky is an integral of Kv, 
and the general form of the integral is Ky + C. This enables us to 
integrate many expressions, for example powers of t. Since = 3/“, 
Dtt* = 4/®, and in general == (w + i)/", we see that = t'K 

= <3, and in general Z),(« + = r. The integral 

of is therefore 4 - C, the integral of is + C, and the 
integral of is + 1) + C. Notice that fortunately we do not 

need to worry about the constant of integration C before the final 
stage of the calculation. We could put in a constant earlier if 
we wanted, saying, for example, that the integral of 3?“ is + Cj, 
where Cj is some constant, and that therefore one-third of this, or 
^^3 _j_ is an integral of t-; the general integral of is accordingly 

j^3 _j_ C2, where Cj is another constant of integration which 

might conceivably creep in. But this can be written + (JC*! “r ^2), 
and the combination (^C, + C2) though complicated in appearance is 
in fact simply a constant which we can write as C. Thus the integral 
reduces to the form + C, which can be obtained by adding on the 
constant C at the end of the operation. 

The formula for the differentiation of a sum also gives us a useful 
rule for integration. We know that if w is the derivative of y, and w 
is the derivative of z, then v + w is the derivative of y -f- z. In reverse 
this means that if y is any integral of v and z is any integral of w, then 
j H- 2 is an integral of v -1 w, and the general integral is >’ -f a: + C. 
(Again we can conveniently ignore the constant C until we come to 
the last step.) The integral of a sum is the sum of the integrals plus an 
arbitrary constant C. 'I'lius since \t'^ is an integral of t, and is an 
integral of the integral of t -f t^ must be \t- 4 - C. Some 

expressions can be integrated by the use of both rules: the integral 
of 4^ -I will be 4 X it- 4 - 6 X 4- C = 2/^ + 2/^ -P C. The 
integral of 2 4- 3 cos t will be 2t 4- 3 sin t 4 - C. 

It is clear that we need a distinctive notation for an integral. One 
fairly obvious method of writing “the indefinite integral of v with 
respect to /” would be y ^ Di~^v; for just as sin“*je means “an angle 
whose sine is x” so D,~^v can be read as meaning “a function whose 
derivative is f”. '^Ihis notation is indeed sometimes used, thotigh as 
a rule the suffix I is dropped and the integral written as D~^v. But tlie 
usual notation for the integral of v is the original one, due to Leibnitz, 



'Lhis is read as 
Thus 


“integral v d <“ and means exactly the same as 
D-H = f tdt= i/2 .p c 
/>-1/2 = f i^dt = i/3 4- C 


/J,"‘ cos t ~ J cos t . dt = sin / 4* C 
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The rules we have given for multiplication by a constant K and 


addition 

can be written 




f Kvdt 

= K f V 

dt C 

or 

DrKKv) 

= KDr^ 

V C 

and 

y (tj + v))dt 

1! 

+ J wdt -h C 

or 

Di-\v -f w) 

= Dr^v 

+ Bc^w + C. 


The first rule has the simple interpretation that if P and Q are 
two moving points, and if the velocity Kv of Q is always exactly K 
times the velocity v of P, then the distance covered by Q is always 
exactly K times the distance covered by P. We leave the reader to 
think of an interpretation of the addition rule. 


PROBLEMS 

(1) Find / (3 + 2/)rf/, f {P + sin t) dt, / (2 + + cos 0 dL 

(2) The rate of growth r mm/day of a fungus is found to obey the 
law e; = 5 + where i is the age in days. Find the length of the 
fungus at age t. 

(3) The rate of growth of a colony of bacteria is found to obey 
the formula v = Je' (approximately: this is strictly speaking an 
idealization, since the gro\\Th is discontinuous). Find a formula for 
the total number y at time t. 

10.3 Standard integrals 

In order to be able to integrate rapidly and efficiently it is useful 
to liave a list of the integrals of the functions which occur most fre¬ 
quently, such as t‘\ In /, sin ^ e', etc. We give a list of standard integrals 
in 'Tables 10. i and 10.2. Some of the more recondite ones, such as the 
integrals of sec t and coth /, will mostly be useful for reference when 
needed. But the reader is strongly advised to learn the simpler ones off 
hv heart. 

Before we present the tables there are a few comments which can 
usefully be made. The correctness of all the integrals can be checked 
by ditferentiation. The integral of P is in general 1), and this 

formula holds whether n is integral or fractional. But when n ~ — i 
the formula fails, since n — i becomes zero and i/(« 4' i) infinite. 
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We know, however, that if t is positive D, In « = and therefore 
= In i + C. If f is negative this fails since a negative number 

has no logarithm. But D, In (-0 = and therefore if t is negative 

J = In (— t) + C. 

In practice too when we have to integrate a trigonometric function 
it will more often be something like sin 2t or cos 3? than the simple 
functions sin t and cos t. Accordingly the general integrals of sin at 
and cos at for any constant a are included in Table 10.2, although these 
can be quite easily deduced from the integrals for sin t and cos t by 
the method of change of variable explained in the next sectionMany 
of the other integrals can also be obtained either by this method or by 
“integration by parts” (Section 10.5). 


Table xo.i—Standard indefinite integrals 
(I) FUNDAMENTAL INTEGRALS 


Function 

V 

Indefinite integral 

y = Dt~^V = j V dt 

0 

C 

a 

at + C 

r (n 7^= — 0 

+ i) + c 

ijt (t > 0) 

In / 4- C 

i/t (t < 0) 

In (- 0 + C* 

1/(1 

tan”^ t C 

i/V (i + 

sinh“‘ t C 

- 1) 

cosh“* 1 C (? > i) 


or cosh”^ ( — i) -f- C U < — i) 

i/V(i 

sin"* t C 

e* 

+ C 

sin t 

— cos / 4- C 

cos t 

sin < -b C 

tan t 

— In cos t + C (cos i > 0) 


or — In (— cos /) C (cos t < 0) 

cot t 

In sin t C (sin / > 0) 


or In (— sin t) i- C (sin ? < 0) 
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Table xo.2—Standard indefinite integrals 
(II) OTHER USEFUL INTEGRALS 


J dt 

J \u t . dt 
J log t . dt 
J cos at . dt 
j cosh at . dt 
J sin at . dt 
J sinh at. dt 
J tan at. dt 

J tanh at . dt 
J cot at . dt 


— iln/ — t -\- C 

— / log i — Mt + C 
= a~^ sin at C 

— a~^ sinh at + C 
= — a”^ cos at C 
= a~^ cosh at C 

— — a~^ In cos at + C 

or — a~^ In (— cos at) + C 
= a~^ In cosh at C 


= a~^ In sin at C 
or a-^ In (— sin at) + C 
J coth at . dt = a~^ In sinh at C 

or In (— sinh at) + C 
J (sec dt — tan a/ + C 

J (sech aty dt = tanh at C 

J (cosec aty dt — — a~^ cot at C 

J (cosech aty~ dt= — coth at + C 

J sec at . dt = 


(cos at > o) 
(cos at < o) 

(sin at > o) 
(sin at < o) 
(sinh at > o) 
(sinh at < o) 


or 


or 


j sech at . dt 
j cosec at . dt 


or 


or 


j cosech at . dt 


or 


tanh”* sin at + C 

In (sec + tan at) + C J ^ 

a~^ In tan {^at 4- i^r) + C j' 
n- In (- sec at - tan at) + C I 

In tan (— tat — ^tt) + C ) ^ ' 

tan”* sinh at C 

— a”* tanh”* cos at C 

«-> In (cosec at - cot at) + C I ^ 

fl”* In tan t at C I ^ ' 

In (cot nt - cosec at) + C | ^ 

fl”* In tan (— \ at) C j ^ ' 

«”* In tanh \at + C {at > o) 

fl”* In tanh (— ^at) + C {at C. o) 
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Table xo.z—Standard indefimte integrals {contd.) 


1 


dt 


a b cos t 


tan 


(y:-49 + ^ 


or 


Vh^—b^ 

/== tanh-^( l\—- tan —^ 

Vi’*— a"- b+a 2 J 

{hr > a~ and cos t > — alb') 


+ C 


or 


coth 


4 -) 


+ c 


J sin”^ at . dt 

j sinh“' at . dt 

J COS'"' at . dt 
J cosh*"' at . dt 
J tan”' at . dt 
J tanh"' at . dt 


1 



‘ dt 


dt 


+ 


dt 


— a^ 


dt 

a^ 

— t^ 


{b^ > a^ and cos t <. — ajb) 
= t sin"' at + (2”' V (t ~ ^ 

= t sinh"' at — a“' (i + aH^) + C 

= t COS"' at — a”* \/ (i — aH~) + C 

= t cosh"' at — a"' \/ {a^t“ — i) + C 

= t tan"' at — In (i + aH-) + C 

= t tanh"' at + \ a"' In (i — a-t^) + C 

= a'**lb In a + C 


dt 


i 
i 

1 

I 

1 Va^~t^ 

f Vt^~Ta-^ dt 

f V dt 

f dt 


Vi^ + a‘ 
_dt 

Vt^ — a 

dt 


= fl"' tan"' {tja) + C 
= — a"' coth"' {tja) + C 
= a"' tanh"' (jfa) + C 
= sinh"' {tja) + C 

= cosh"' {tja) d- C 


= sin"' {tja) + C 


(t^ > a^) 

(/2 < a ^) 

{a > o) 
(a > o) 


{a > o) 


= ^a- sinh"' (t/a) \t Vt"^ a^ 


— — cosh"* {tja) 4- 


= sin"' {tja) + \t Va^ — t^ 


{a > o) 
{a > o) 
{a > o) 


These standard integrals can be used in conjunction with the rules 
for addition and multiplication by a constant to integrate a great variety 
of expressions. 
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EXAMPLES 

(1) The velocity of a moving body is given by = 2 sin zt + 

3 cos zt. Find the relation between its position y and the time t. 

V dt = z J sin zt . dt -)r Z J 

= — 2 (I cos zt) + 3 (i sin zt) + C 
= — cos 2^ + t sin zt C 

(2) The rate of growth of a chicken embryo over a certain range 
is given by the formula w, = Kt^ ^. Find the weight w as a function 
of the time t. 

w=^ f w,dt = K J « dt 

= {Kl 2 6 ) 

(H. A. Murray, Jour. Gen. Physiol., 9 (1925), 48, gives the formula 
w = •668i^'®, corresponding to AT = 2*405, C = o.) 

(3) Integrate zj = (i + tyjt with respect to t. 

By expansion z) = + 2 + ^ whence 

J V dt — \n t zt -{• C. 

PROBLEMS 

(1) Verify the integrals of Table 10.i by showing that v is the 
derivative of y in each case. 

Integrate the following expressions with respect to 

(2) I + i 

(3) zt + 

(4) sin t -h sin zt sin 3L 

(5) (I + 

(6) (I - r-)/(i -r 

(7) 9lv{t^ -i- 4) - 4/v(^- “ 9)- 

(8) sin t cos t + tan /. 

(9) sinh 3/ + cosh 3/. 

(10) sinh 3^ — cosh 3/. 

(11) I + / + /" + P- 

(12) (I +/‘)/(i +0- 

10.4 Change of variable 

In Chapter 8 we found five rules which help us in differentiation. 
These rules were those for differentiating a sum, y -h ar, a product by 
a constant, A'y, a general product yz, a quotient yfz, and a function 
of a function. Any expression can be differentiated by splitting it up 
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into simpler expressions connected by additions, multiplications, etc., 
and by using the appropriate rules. It is reasonable to suppose that a 
similar process can be applied to integration, and indeed we have 
already shown how to use the rules for a sum and for a product by a 
constant. The other rules can also be applied; in this section we shall 
study the function-of-a-function formula and in the next section the 
product formula. (The quotient _y/5r can be looked upon as the product 
of y and i/^r, and does not giv^e an essentially new result.) Unfortun¬ 
ately we shall see that although these formulas can be extremely helpful 
they no longer give an infallible and universally applicable method of 
obtaining the answer we require. 

Suppose then that we are given the rate of change v = yt of a variable 
y with respect to a variable t, and that we wish to find y itself. Then 

y — J yt dt = J V dt', that is the definition of an integral. Now in 

this definition there is no need for t to represent the time, although 
it will often do so. We can take any other suitable variable x, and if 

we know the derivative yx we can find y by integration; y ~ f yx^^- 

This second integral might be easier to work out than the first one: 
the problem remains to find j,. But this is given by the usual function- 
of-a-function rule (8.19); yx = yi^x = so that 

j = Jyxdx = J vtxdx . (10.i) 


If instead of the notation we write the derivative in the classical 
form dtjdx then equation (10.i) can be written 



J V dt J 



(10.2) 


This is the formula for the change of variable from / to x in an integral: 
writing it in this way has the advantage of suggesting the correct 
answer, since if we could treat the expression dx as if it was a number. 


dt 


and cancel it by the ordinary rule for fractions, we would have dx = 


dt. Strictly speaking that is not logical, since has no independent 

existence and only forms part of expressions like and J ii dx\ but 


still it is a useful aid to memory. 

Naturally the constant C of integration must not be forgotten; but 
it can be left to the last step, as with the rules for addition and multi¬ 
plication by a constant. 


As a practical example consider y = 


J e*" t dt: V = e^*t. 


Now we 


already know how to integrate an expression like e', hut not one like 
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e*". It is therefore natural to change the variable from t to x = 
so that e*" becomes e*, and so try to simplify the integral. We have 
now to express it in terms of x. An inversion of the equation x = 
gives t = ± Vx, and e*" = so that v the integrand (quantity to 

be integrated) becomes ± e^-s/x. Also ^ = i with the 4 - 

sign when t = + ^/x, and the — when t = — By our rule 

fvdt = fv'^ldx 

= / (±e=‘^/x){±■ll2^/x)dx 

= J \e^ dx 

= + C 

- + C 

since x = t-. Thus the integration is completed. 

Again suppose that a quantity y is known to have the rate of change 
^ q. i). Now our table of standard integrals allows us to 

integrate ift, but not i/(/ + i) with respect to t. Thus in order to 
find V it is natural to write .t = f so that z> = i/.v. Also since 
? — .V — I, we see that dtjdx = t^= j. Therefore by (10.2), 

y — J V dt 

= J .V-* dx 

— In (:r .v) + C 
= ln(±iii) 4 -C 

where we take the + signs if / + i > 0, and the — signs if / + i < o. 

In the same way f rf/ = In (dr i rb <*) + C. 

Sometimes it may be necessary to make two or more substitutions 

to reduce the integral to a standard form. Consider J e‘(i e*)~^ dt 

— J V dt (say). The first substitution it is natural to make is to put 

= .V, or t = In .v, so that dtjdx = i/.v. Now v — e* (i + e*)“® — 
a: (i + v)“2 and so 

f Vdt = f V dx 

^ ^ .V (i -b .v)“^ x~^ dx 

- - y (r -b x)-- dx 
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It is now natural to try i + -v = ~> giving x = z — i, dxjdz =■ i. Then 


y = J {i x)~^ {dxjdz) dz 

= J z-^dz 

= - 2:-i + C 

This completes the integration process, but the answer has to be 
translated back into a function of t. Since 2 = (i + x) and x e* 
we have 

;y = — Z-^ + C 

= - (i + C 

= - (I + C* 


It is clear that this method of change of variable is a very powerful 
one: many of the integrals of Table 10.2 are obtained in that way. 

Thus to get J cos at . dt put at = z, t = xja, dtjdx = ija. We have 
then 



cos at 


, dt J cos at . {dtjdx) . dx 
= J cos X . a~'^ . dx 


= a~^ sin x + C 
= sin at + C. 


There is a particular form of integral, which is of frequent occur¬ 
rence, in which some of the work can be short-circuited. This is when 
the integrand v is an obvious product of one variable (say it) by the 
derivative of another (say dxjdt), 

V = u . dxjdt 

in which case it is usually helpful to change the variable from t to x, for 


J V dt — J u . {dxjdt) dt 

= j udx [by (10.2)] . . (10.3) 

Thus consider y — f tan / . dt. We know that tan t = sin //cos /, 
and sin t is the derivative of ar = — cos /. Thus 


= J (cos /)”' {dxjdt) . dt 
= J (cos /)-* dx 

= y dx 

= In ( i *) + C 

= In (db cos /) + C. 
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Consider also the case p = c* sin e*. Here we know that e* is the 
derivative of = e*, so that v — sin e* . dxjdtf and 


J V dt = J sin (dxidt) dt 
= J sin c* dx 

= J s\Ti X dx 

=■ — cos :c + C 
= — cos e* + C. 


In the same way if we want to integrate v ~ it is natural to notice 
that e* is the derivative of = e*^ so that 


J V dt J e** {dxjdt) dt 

= J dx = C = + C. 

The reader is recommended to practise the method of change of 
variable, as it is a most useful method of integration. 


PROBLEMS 

Integrate the following expressions with respect to t: 

(1) e^^lVt 

(2) cos {t + 3) 

( 3 ) tlV{i~t^) ^ 

(4) t sm {2t^ + 3) 

(5) tan t . In cos t 

(6) sin t . (cos 

(7) . In (i + e*) 

( 8 ) V{ 2 t+ 3 ) 

(9) cos //(i + sin f)® 

(10) log(4i + 2) 

(11) sec using x — sin t 

(12) 

(13) e7(i+e‘) 

(14) i-'Vi^+^rxt) 

(15) sin t . cos t 

(16) t^i H-i*)-' 

lo.s Integration of a product (integration by parts) 

We have seen above that if the quantity v to be integrated can be 
expressed as a product u . dxjdt then 
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and this second form of the integral may be easier to deal with than 

the first. • r 1 

Also in this case 2; = « . dxjdt there is an alternative formula 

which may lead to a simpler integral. This depends on the formula for 
differentiation of a product 

d(ux)/dt = u . dxjdt + dujdt . x 

from which we have 

V = u . dxjdt = d{ux)jdt — dujdt . a; 

Now the integral we want is accordingly 



! 


du 
1- X 

dt 


dt 


But integration is the reverse of differentiation, so that 

J ^dt^ux-\-C, and 

y UX -\- C — I ^7 ^ 


But the constant C is not really needed in this formula, since the integral 
{ — X . dt will have a constant of integration to be added on, and 

J dt 

the sum of two arbitrary constants is equivalent to one. Thus finally 


we can state 







. (10.4) 


As an example, consider the integration of t cos t. We know that 
cos t is the derivative of sin t, so v can be written as u . dxjdt, where 
u = / and x = sin t. Therefore by (10.4), 


J V dt = J i cos t . dt 


■=■ t sin t 




dt 

dt 


sin t dt 


= t sin t — J sin t . dt 
= t sin t + cos t C, 
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Similarly when integrating te^ we observe that c' is the derivative of 
X = so that 


\te‘dt = tx- j %xdt 


— te* — J e' dt 

— 


For the ready use of formula (10.4) it is helpful to cast it into slightly 
different form. Let us put dxjdt = w, so that v = uw. Then is the 
integral of w, and the equation can be rewritten as 



j HKJ dt = u 



( 10 - 5 ) 


or in words the integral of a product {uw) equals the first (k) times the 
integral of the second (kj) minus the integral of [the derivative of the first 
times the integral of the second]. Here we can take for w any expression 
or function whose integral is known, and then u = vltv, to make v=uw. 


I du ,, 
ux — \ T- X dt 


EXAMPLES 

(1) Integrate = (f + i) sin t. Put u = t i, tv = sin /, so that 
X = J w dt = — cos t. (The constant C to be added is left until the 

final step.) 

fvdt- .... 

— ^ i) cos ^ + y cos / dt 

~ — (t 1) cos / + sin / + C 

(2) Sometimes it may be necessary to repeat the operation. Con¬ 
sider the case v = t~e*; put u = i\ to = e\ then x = f w dt = eK 

Thus 


J t) dt 


H.v — \ ^ X dt 
dt 

te* dt 


1 

= _ 2 y 


But we have already shown by “integration by parts” that J te^ dt — 
fe* _ 4- const., so that 


J t~e* dt = t-e^ — zte^ + 2e‘ -|- C 

(3) There is no reason why we should not write v v . i, so that 
u I, and x ^ f w dt = t. In this case formula (10.4) gives 

v = f vdt = tv — tdt . . (10,6) 
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For example let = In so that dvjdt = ifty then 

J \n t dt = tin t — J 1 dt 
= ilnf — t C 

Alternatively put v — sin”^ ty then 

J sin-^ t dt= t sin-1 ^ j y/ 

This second integral is dealt with by the method of change of variable 

Let I — =■ Zy then ^ so that 

dt 




dz 


whence 


= — \ J dz 

= _ i/j . + C 

= - (i - ef + c 

f sin -1 I — f sin -1 / ± (i — + C. 


PROBLEMS 

Find the following integrals y — f ^ dt where 

(1) V = t cos t 

(2) V t In t 

(3) V = 

(4) V sin t 

(5) V = V(i + 

(6) In the integral f sin-i t dt = t sin-i Z ± (i — Z-)* + C when 
must we take the plus sign, and when the minus? 

10.6 Failure of methods of integration 

Certain integrals cannot be coaxed into a standard form by any 
of the rules we have given. No amount of change of variable or 

integration by parts will give the values of y* e-*' dty J In sin t . dt or 

J y/(^i 4- /a) dt. Nevertheless we can imagine a point moving in such 

a way that its velocity v at any instant is given by the formula v ~ e-^\ 
or by «; = In sin /, or ^’ = \/{i + t^). The failure does not mean that 


H 
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there is no integral like >- = f dU it merely means that we cannot 

express this in terms of the familiar functions formed by powers, logs, 
exponentials, trigonometric and hyperbolic functions. We have to 

regard J e“‘" dt as an entirely new function of t, one which we have 

not met before; and by choosing suitable recalcitrant integrals we can 
get a wide variety of such functions, some of them of considerable 
practical importance. Later on we shall find ways of calculating their 
values numerically, in series, or in other suitable forms. The values of 
those which occur most frequently have been worked out and are 
given in tables [see A. Fletcher, J. C. P. Miller and L. Rosenhead, An 
index of mathematical tables^ t 94 ^ (London: Scientific Computing 
Service Ltd.)]. 


10.7 Differential equations 

Integration is the process of finding the value of a variable quantity 
V when its rate of change dyjdt — yt is a known function of the time t. 
But it often happens that yt is expressed as a function of y rather than 
of t. The rate of cooling of a hot body will depend in the first place 
on its own temperature y and on the temperature Y (say) of its sur¬ 
roundings. In fact, if the difference of temperature y — T is not too 
greats the rate of cooling will be proportional to this difference, so that 

we can put , . 

y,= -K(y-Y) . . . ( 10 - 7 ) 


wiiere AT is a positive constant. 

Again if w’e have a population of bacteria w’e shall e.xpect its rate 
of increase to be proportional to the number y of bacteria already 

present , 

y.^Ky . . ‘ • (lO-S) 


Of course, strictly speaking w’e cannot define an instantaneous 
rate of increase of a population, since it grows discontinuously. But if 
the numbers are large no harm will be done by treating it as if the 
number r in the population could change continuously. 

Equation (10.8) will hold for an isolated population of bacteria 
where the density is so small that effects of overcrowding, exhaustion 
of food supply, and poisoning by metabolic products can be neglected. 
We might take a more complicated situation in which the population 
grows and is also increased by an immigration from outside. The rate 
of growth by cell division will be Ky. We suppose, for illustration, 
that the rate'of immigration is a linear function A - 4 - Bt of the time t. 
('I’his will include a constant rate of immigration if w'e put B = o.) 
'ri\e total rate of growth will therefore be 


V, = K \ + A Bt 


(10.9) 
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The equations (10.7), (10.8), and (10.9) express relations between 
the rate of change yt = dyjdt of a variable y, the variable y itself, and 
the time t. They are known as first order differential equations. From 
them we would like to be able to find what y itself is, expressed as a 
function of the time: this would be a solution or “integral” of the 
differential equation. 

If the differential equation is of the form "yt = a function v of f' 
then we can find y by direct integration. If it is of any other form 
then some manipulation will be needed to obtain the solution. \\ e shall 
not discuss here all the possible devices, which can be found in text¬ 
books of differential equations (e.g. H. T. H. Piaggio, An elementary 
treatise on differential equations and their application, 1946, G. Bell; or 
E. L. Ince, Integration of ordinary differential equations, 1949, Oliver &: 
Boyd); but there are some simple and commonly occurring types of 
differential equation which deserve mention. 

10.8 Some equations of growth of populations and similar processes 

The next simplest type after the straightforward integral is the 
differential equation of the form 

y'l = a function oi y only. 

Examples of this are equation (10.7), ^ {y — ^). ^nd equation 

(10.8), yt — Ky. The correct procedure here is to reverse the natural 
line of attack, and instead of trying to find y in terms of t, to find t 

in terms of y. By the definition of an integral, t — j t^dy\ but ty~ 

and by hypothesis we already know this as a function of y. 'Thus con¬ 
sidering first equation (10.8), which is simpler than (10.7), we have 

yt = Ky, so that ty = ijKy. Thus t = J t^dy == (In y)iK + C. 

[y is by supposition positive, so we need not consider the integral 
t = (In [— y])IK 4 - C.] We can now solve this equation for_y in terms 
of t: 

^ - C = (In y)IK 
K{t — C) = In y 

y gK {t-O 

If we write e~^^ = c this can alternatively be written 

y ^ ce^i .... (10.10) 

which is therefore the law of growth of a population in the absence 
of competition or other limiting factor and with abundam food. 'I'lie 
solution (10.10) is shown graphically in Fig. 10.2 overleaf. 

We can solve (10.7),>*1 = — K (y — Y), in a similar way. We have 
/y = — i}K(y — Y) and therefore 
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Fig. 10.2_Curves illustrating the solutions of differential equations 

(i) exponential law of free growth, .v == ceKt 

(ii) logistic law of limited growth = 3L (i -f tanh [AL (t—C)/AJj 
(iii) Newton's law of cooling, y ~ Y + ce-A'f 


To find this integral put so thaty=ir + y, and dyldz=i. 


t 



I 

Z~(3nry) 

dz 






= — (In z')\K + C 
- - In {y - Y) 


'I'his is the solution: to get it into the most useful form y must be 
expressed in terms of t. We have 

f - C - K-^ In - Y) 

K{C- O-ln {y- Y) 

y _ y^e^'(C-0 

where c = 

V = V + ce-^^ .... (lo.ii) 

This is therefore the law of cooling of a body at temperature y when 
the surroundings are at a fixed temperature 1 . 

Equation (10.9) can also be solved in this way provided that we make 
a simple change of variable. Let us put A_v -t- Bt = z, then 

Zt = A'V( + B 

= K (Kv 4- A Bt) Y B 
= K{z + A) + B 
- Kz + (A'/l -f B) 

- K[z -r A Y B'K] 
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Now make a further change of variable by putting xr + ^ 4- BjK = 
w = Ky Bt A 4- BjK. Then 

Wf = Zi — K w 

so that = ij^t “ and on integration with respect to w 

t — J dw = \n zv -j- C 

= K-^ In [Ky Bt + A + B;K] + C 

i.e. In [Ky Bt A B/K] = K (t - C) 

Ky -\- Bt A BjK = 

where c = e~^^y 

Ky = — Bt — A — BjK 

y = [ce^^ — Bt ~ A — BIK]IK . . (10.12) 

This accordingly gives the law of increase of population with a 
steadily rising rate of immigration. 

After solving a differential equation it is useful to check the correct¬ 
ness of the solution. From the mathematical standpoint this can be 
done by differentiation and substitution in the original equation. Thus 
from (10.10), y — we have = cKe^* = Ky, which agrees with 

the differential equation (10.8). We can also often see whether the 
answer agrees with common sense. 'I'hus we know that with K 
positive, increases slowly at first, but rapidly gains speed and finally 
increases very rapidly indeed. This is just what we would expect of 
a population of bacteria. Hut we also see that when the period of 
extremely rapid multiplication has set in the population must soon 
approach a condition in which limiting factors become important, and 
the original equation will fail to hold. The equation of cooling j = T 4- 
ce-^* is also reasonable. approaches zero very rapidly at first, and 

soon becomes inappreciably different from zero. After that it decreases 
extremely slowly. 'I'hus a cooling body will fairly soon attain the same 
temperature as its surroundings for all practical purposes—how soon 
depends on the constant K: the smaller K is, the slower the cooling. 
But in theory according to this equation absolutely exact equality will 
never occur in any finite time, since is never exactly zero. Kquation 
(10.12) has a more surprising form, since one would not expect negative 
terms —Bt -- A — BjK to occur. But when t is large these terms will 
be small compared with so that after a time the population will 

increase exponentially, with the natural growth overwhelming anv 
effect due to immigration, until limiting factors inter\'enc. 

We notice too that in each of the solutions there is an arbitrary 
constant C. 'I'hus for a growing population of bacteria where 

c = apj C is arbitrary; or, if wx* prefer, we can say that c is an 

arbitrary positive number. We can fix the value of this constant by 
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specifying the value of y for one particular value of t; e.g. in the 
equation 3- = ce^‘, when t == o, y = c, so that c can be interpreted 
here as the number of bacteria at the start of the experiment. In general 
the solution of every differential equation has an arbitrary constant 
contained in it. We shall not give a formal proof of this, or state the 
exact conditions under which it is true: but it agrees in a general way 
with common sense, since the differential equation only gives the rate 
of change, and one would expect different initial conditions to give 

different solutions. , , . t. 

Another important equation soluble by the method given above is 

y = iCy (i —ylL)- This represents the free gro^vth of a population 

when limiting factors are taken into account. For when y is small this 

is approximately the same as the equation y, = Ky which we have 

already considered, as i-y/L=.i; but when the population y 

approaches L the growth slow's down again as (i ~ yjL) approaches 

zero. From this ty = i/ivy (i —y/L) and 

t = j t,dy = j 

One way of integrating this is to “complete the square” as for the 
solution of a quadratic equation: 


Ky(i — yjL) = — Ky-jL + Ky 

= - KL-^ (y= - Ly) 

= - XL-' [O' - iL'f - 
J (y - \Lf - }L^ 


1 = - X-'L 


Now change the variable of integration to » = (>’ — iL); since 
^ ~ y. = I and so by use of Table 10,2 

= tanh"^ ^ 

- [2L-' (y - ^L)] + C 


+ C 


= 2KL~- tanh”* 

= 2KL~- tanh”^ 

/ — C = 2KL-~ tanh“^ 

(/ - C) = tanh-^ [ay/L - i] 

2\iL — 1 = tanh {t — C)] 

V = (i + tanh {t — C)]) 


zy/L — I 
[zy/L - i] 


(lo.i;;) 


This is the “Verhulst-Pearl logistic law of growth” and under good 
experimental conditions it is fairly closely obeyed. Since tanh w 
approaches -1 as » — - oo and i as « -> + co this means that the 
population increases ver\' slowly at first, then has a period of rapid 
growth, and finally slows down again, tending asymptotically to the 
limiting population L (see Section 6.14 and Fig. 10.2). 
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10.9 Methods of solving differential equations 

As with the problem of integration there is no golden rule which 
will solve every differential equation without fail. But there are a 
number of standard types which can be solved, and a certain amount 
of manipulation, coupled with judiciously chosen changes of vyiable, 
will often reduce an equation to one of these types. Some of these 
types are rather rare, and are given here mainly for reference if required. 

{A) Variables separable 

If the equation can be written in the form y’t = a function of i 
divided by a function of y=mig(y) (say), then it is solved thus. 
Multiply through by g(y) giving g{y)y, and integrate both sides 

with respect to t : fg(y)y,dt = / M dt. From our rule for change 
of variable in an integral we know that J g{y)yt dt ~ J g{y) d \, so 
that the equation becomes 

/ ^(>') <^y = //(O 

When we have (if possible) performed these integrations we have a 
relation between y and t, which is the required solution. 


EXAMPLE 

^y{t = i-’/y-*. Multiply both sides by y', and integrate 

J y"* yt dt = J y“' dy = J dt 

That is, 

In (y) + C, = In (i /) + Co 

where and Cj are constants of integration. This means that 

In (± y) - in (:h 0 = C*! - C-o = C (say) 

In (±y/0 = C 

yjt = ±e^ ^ c (say) 
y ~ Ct 

where c is an arbitrary constant. 


(/i) Homogeneous equation 

A first-order differential equation is said to be “homogeneous” if it 
can be expressed in the formy, — a function of (y/0 ~ /(y/0 say. In 
this case the change of variable from y to s = yjt will give an equation 
in which the variables can be separated. For Zt ==■ {tyt — y)jt- = 
^^ty^ — tz)lt'^ — (y, — z)lt, and y, is a function of z only. 
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EXAMPLES 

( 2 ) The equation yt = yjt which we have already considered above 
is also a homogeneous equation. Putting yjt = ar it becomes y* — sr, 

— z)lt = o, so that yjt = ar = constant. 

(3) The equation + zyty, -y^-=o is homogeneous, for on 

dividing by it becomes i d-zCv/Oy,—(y/0^=o, or 

=.(z^-i)f2z. Therefore z, == {yt - z)lt = - f i)l2zt On 

bringing all terms containing z to the left, and t to the right, we have 

— 2ZZtl{z^ + 1 ) == 


Integrate both sides with respect to t: 


f 2 ZZt dt _ 

[ 2 Z dz f 

] z^ -\-l 

J + 1 J 


dt 


The right-hand side is In (± 0 + where Cj is the constant of 
integration. The middle integral is evaluated by the substitution 
z^ 1 = w, since w, = 2 z, and so 


2Sr dz 
z^ H- I 


-j 

t dw _ 

w 




loz dz 


w 


= — In (rh w) + O2 


Thus the equation becomes 

— ln(±w) -t-C2 = ln(di^) -J-Oi 

This is effectively the solution of the equation. It can be reduced 
to a form giving y explicitly in terms of i in the following way. We first 
rearrange the solution to give 

In (i 0 “b in (i w) = C 2 

In (± /w) = Ca — Cj 

/ttj = 4 - _ c (say). 


i.e. 

i.e. 


But w = -f- I = (y/ 0 “ + J = + I, so that 

t {y-jt^ -h i) = c 
or on multiplying through by t, 

^2 _j_ ^2 „ 

or y = i 

ivbere r is an arbitrary constant. 
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(C) Equation reducible to homogeneous form 

If the equation is expressible in the form y^ = 2. function of 

Hv ^ K 

. where H, AT, A, /, k are constants, then the change of 
hy -\-jt -\-k y j> > 6 

variables to F = Hy Jt K and T ~ hy -j- jt + k will generally 
reduce it to homogeneous form. For Yt = Hyt + J, F, = /ijy, -}- j, so 
that Yt = YffTt = {Hyt J)l{hyt +/) = a known function of yt = 
a known function of YjT. But if Hj = hj this method fails: the change 
of variable from y to T = hy -\-jt + k will then be found to solve the 
equation. (The methods of integration described in Chapter 15 may 
sometimes be required to obtain the complete solution.) 


(Z)) Equation solvable for y 

Sometimes it may not be possible to express yt in any convenient 
way in terms of y and t, but it may be possible to express y in terms 
of>'( and>'. In this case write 7* = v, so that we can write = f{v, t). 
On differentiating this with respect to t we obtain a differential equation 
for V (since jVt = v) which may be soluble: having obtained v in terms 
of t we can find y from the original equation. 


EXAMPLE 

(4) Consider the equation y = — ^y,^ = ^v^ — \v^. 

On differentiating with respect to t we obtain 

y^ V = V^Vt — VVf, 

i.e. 1 = {v — i) 

The variables v and t are now separated; and on integrating with respect 
to t we have 

t ±C^ = f (w — i) Vt dt 

= J {v — 1) dv = — z) + C2 

This gives us a quadratic equation for v 

— zv + (C2 — Cl — <) = o 
or writing C^ — Cj — 1 — c, 2 constant, 

(z) — i)2 -f (c — /) = o 
V = 1 ± — t) 

But we know that jy = §z ;3 — ^zj*, so that we obtain v in terms 
of /. 
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(jE) Equation solvable for t 

Sometimes t can be expressed in terms of yt and hut yt not in 
any convenient way in terms of y and t. But yt = ijt^, so ^at then t 
can be expressed as a function of ty and y \ and this is simply the pre¬ 
vious case (D) with the variables y and t interchanged. 

(F) The linear equation 

An equation of the form 

where P(t) and Q{t) are functions of t only, is said to be “linear”. It 
could arise biologically in the following way. We have seen that a 
population provided with ample food supplies will increase according 
to the law V( = Ky, where K is the “growth constant”. Now suppose 
that some external factor influencing the rate of growth, such as the 
temperature, is not constant but is a function of the time* Then the 
equation will become yt = y. P{t) instead of yt = Ky. Suppose 
further that there is a rate of immigration Q(i) which is also a known 
function of the time; then this must be added to the rate of increase 
of the population, which will become yt = yP(l} + 0(0- ^ linear 
equation is solved by the following special trick. Integrate F(/); let 
w be any one indefinite integral. Then by definition tVt = P{t). Now 

consider 

^ = ye-'^ 

Zt — yte-'’^ — ye-'^Wt 

= [yP{t) + Q(i)]e~'^-ye-'^Pii) 

= 

But is a known function of /, so that we can integrate this to find z: 

y = e'^z e'^ f Q{t)e-'° dt. 

EXAMPLE 

(5) y^ = yjt + qt {q constant). This corresponds to a population 
in which the natural rate of reproduction is being slowed down as ijt, 
but in which the rate of immigration is steadily increasing according 
to the law gt. Here P{t) -= ift and Q{t) = qt. Now one integral of 
P(/) is wj = In /, and e*^ ^ i, e-'^ = ijt. Therefore 

y = e'^ J Q{t)e-'^dt 
— t J qtit . dt 

^ t J q dt — t {qt C) 

'rhus if t is large the population will increase roughly as qt‘^. 
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(G) The Bernouilli equation y^ = y . P(t) + y^ . Q(t). 

This becomes linear on changing the variable to z = y^~'*. 

(H) The Riccati equation yt— y^ • Pit) +jy . ^(0- 

This can be solved completely if one solution can be found. Let 
Y be this solution, and write z = i/iy — Y): then it will be found that 
the equation for z is linear. 


EXAMPLE 

(6) = iy^ + - 2t)y + - t-'). 

One solution of this is y = i, as is immediately verified by sub¬ 
stitution. Set therefore z = i/(>' — i), then 

= —yt/iy — 0 ^ 

= _ ^2 (f-i _ 2t)y + (r - /-')] 

But y = 1 + z-'^: on substituting this value we find 

z, = — t-^z — t 

This is a linear differential equation for z. If ttJ = ( — t~^)dt 

= — In / then the integral is, as shown in paragraph (F) above, 

z = e'^ J i—t) e-'^ dt 
= /-* J (— P) dt 
= /-M- + Q 

= - -P Cjt 

whence y = i -f z~'^ = i + tfiC — Jz®). 
ij) The "‘exact” equation 

If we take any relation between y and t, and write it in the form 
/Cx. t) — o, then on differentiation we shall obtain an equation con¬ 
necting y, y, and t. Let /yirCv, 0 and /,]„ (y, t) be the partial derivatives 
of/(y, /) when y and t arc allowed to vary independently; then when 
y and t are subject to the constraint/(y, ^) — o we shall have/i(y, 0=®* 
j.e. by the rules for constrained variables 

/virO'. 0 O'! +/(ivO'. 0 == o • • (*015) 

We shall get the same relation if we start with /(y, t) == C, since then 
ftiy, t) = Ct = o. Thus if we have an equation of the form (10.15) 
its general solution will be /(y, /) = constant. Such an equation is 
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said to be “exact”. That is to say that if our equation is of the form 
M{y, t)yi + N{y, t) = o, where M{y, t) and N(y, t) are functions of 
y and t, then this equation will be exact if we can show that M[y, i) is 
the partial derivative of a function /(y, t) with respect to y, and N(y, t) 
the partial derivative of the same function with respect to t. It may 
not be obvious at sight whether this is so or not, and fortunately there 
is a test (discussed in Section 12.12) which enables us to find out. An 
equation M{y, t)yt + N{y, t) = o ‘\^ exact if and only if 

0 = 

If this is so we can reconstruct the function /(y, /) by methods to 
be given in the next section. If it is not so it is sometimes possible to 
multiply the original equation through by a function, known as an 
“integrating factor”, which will make it exact. Unfortunately there is 
no rule which enables us to pick out such integrating factors in general: 
but sometimes they may be guessed. 


EXAMPLE 


(7) yt -1- (3^ + 3'/0 = o- 

If we multiply through by t we obtain iyt + 3^- y — o, 
M(y, /) = /, N(y, 0 = $ 1 - + y. These evidently satisfy Mt\y == i = 
and the equation is now exact. 

In fact N{y% t) = + ty) and M{y, t) = + ty), so that 

the solution must be ty = C, or y — C/t — i-. 

Note. In our illustrations of the methods of solution of differential 
equations, the equations have been divided by various expressions 
whenever it seemed convenient without considering whether these 
expressions were zero or not. Strictly speaking we should have been 
more careful: but such details would have interrupted the argument. 
As a result of neglecting this precaution several special solutions of the 
equations have been overlooked. Thus in solving equation (10.8), 
Vf = Ky’y we divided by y. This will not be possible if = o: and in 
fact 3' = 0 is a solution, representing a population which is always zero 
and therefore can never grow. In solving (10.7), = — K(y ~ T), 

we divided by y — Y; but y — T = o, i.e. y = T is a possible solution 
of the equation of cooling, representing a state in which the temperature 
of the body is always equal to the temperature of its surroundings, so 
that there is no exchange of heat. In the same way the logistic equation 
3’, = Ay’(i — y/L) has solutions 3'— o, a zero population, and 
I — yjL = Oy y = Ly i.e. a population which has attained the limiting 
value and stays there without further change. The reader can, if he 
wishes, investigate the other equations in a similar way. 
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PROBLEMS 

Solve the following diflferential equations: 

(1) yt = y^ 

(2) = secy 

(3) y(^^ — Oyt = — 0 

(4) 2ytyt=y^~t^ 

(5) yt = fy- 

(6) yt = cot y . cot t 

6 ) yt = {y — t)i{y + ^ + 2) 

(8) yt = {t + i)ylt 1 — 

■ (9) yt = y^ 4- 2(1 — e-*^)ty +3' 4 - (i — 0 + — 2) 

(^y = t is one solution) 

(10) sin y . cos t 4- cos y . sin t . yt = o 

xo.xo Partial dififerential equations 

So far we have considered ordinary differential equations in which 
all variables arc expressible as functions of a single one. What happens 
when we have more than one independent variable, so that we have 
to deal with partial, not ordinary, derivatives? 

Suppose that we have a quantity y which is a function of two 
variables, say t and w, 

y=f(t, u) 

Then y will have two partial derivatives, one, v,. with respect to t 
keeping u fLxed, and one, y^, with respect to u keeping / fixed. These 
derivatives arc more usually written as dyj dt and dyjiu, or ft{ty u) and 
/u(/, u) respectively. Let us first suppose that we know the value of yt 
only: what then can we deduce about j? 

By definition yt is the derivative of y with u fixed. If then we take 
any particular value of u, say u — and, keeping u fixed, integrate 
with respect to t, we shall obtain the values of y for this value of n. 

y = f{t, M,) = f ytdt + 

where C, is a constant. Here t is allowed to vary, but u is not. 

On taking another value of w, say u = u^, we shall have in the same 
way 

y u^) = f ytdt ->r Cg, 

where is a constant. There is no reason why the two constants 
C\ and Cg should be equal; each of them can have any value we please. 
Our information only tells us how rapidly y changes with t when u 
IS fixed: it tells us nothing about what happens when u varies. 

Thus for each value of u we can say that y = f ytdt C, where 

the value of Cmay he different for different values of u. In other words 
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C is a function of «, and only of m; we might write it as <^(«), and so 
obtain finally 

y= J ytdt + ^(w) • • • 

In this the integration is understood as performed while u is kept fixed, 
and 4 >{u) is now an arbitrary function of u instead of a constant of 

integration. . , 

Looking at the matter graphically, suppose we ploty against u and j 

(Fig. 10.3), choosing t and u as horizontal co-ordinates in a plane, and 



Each section m =* constant can be imagined raised or lowered by a constant amount, 
as indicated by the dotted lines. If, however, one section t — constant is also given 

the whole surface is determined 


representing y by the vertical distance above the plane. The points 
corresponding to the values of y as t and u vary lie on a surface. Now 
a knowledge of y, is equivalent to a knowledge of the slope in any 
section of this surface by a plane ti = constant. In other words, we 
know the shapes of these sections, but we do not know how high each 
of them is above the (f, u) plane. If we are given one point in each of 
these sections then we can find the whole surface. For example, it is 
sufficient to know a single section t = constant, i.e. to know all values 
of y for a given value of t and varying values of u. 


EXAMPLE 

(i) Consider the equation Sy/?/= y, = m +/. This gives 
y . J (h t)dt + ^(h) = ut + U- + •fiu). Here we can take any 

function we like for ^(m); ut + ^t", ut -f- If" + 2«, ut + + 27-5» 

and ut f- if" + V[^“ “ I si^ tanh are all solutions. This can be 
seen on differentiating with respect to t to verify the original equation, 
for the partial derivative of ut -f- If- is (m + f), and the derivative 
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of the function ^(m) is zero when u is held fixed, since <^(«) is then also 
fixed. 


It will be seen from the above argument that if we are given both 
partial derivatives and we can reconstruct the original function 
y except for a constant C of integration to be added on. In fact we 
have more information than we need. From the value of jy, we can 

deduce that y = j yidt -Y- ^{u) — I {t, u) + ^(k). where I (t, «), the 

integral of the given function is known. Differentiation with respect 
to u shows that 

yu = “) + 

or ^„(«) = y^~ «). 

Now u) is obtained by direct differentiation, and is therefore 
known. It follows that if we know the value of y^ for any one given 
value of t we can determine the derivative unknown function 

4 >{u). We have then only to integrate to find 4 >y ^rid hence jy: 

du -YC 

Graphically this means that if we know jy, for all values of t and «, 
and also the shape (but not the height) of any one section t = constant 
of the y surface, then we know the shape of the whole surface; but it 
can still be moved up or down as a whole according to the value of the 
constant C. 

EXAMPLE 

(2) Consider again the equation y, = {u -h Oi suppose we also 
know that y^ = o when i = o for all values of u. It has already been 
shown in example (1) that 

y = Ut -Y + 4 >{u) 

whence ju = ^ + <t>M- 

Putting / = o, _yy = o, so that ^u(m) = o for all values of u. 1 his 
shows that ^(w) is a constant, C, say, and the complete solution is 

y = ut -Y -Y C. 


Any equation connecting the partial derivatives _y, and y^^ with the 
variables /, u and y is known as a “(first-order) partial differential 
equation”. 'Phe solution of such an equation will in general contain 
an arbitrary function in some way, just as the solution of an ordinary 
differential equation contains an arbitrary constant C. Many natural 
laws are expressible by such equations—e.g. the laws of flow of heat, 
or of electricity and magnetism. Unfortunately the methods of solution 
of such equations form a large and complicated subject, and we shall 
merely give a few simple and important examples here. 
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EXAMPLES 

(3) Consider the equation dyj 8 t = o, where y is a function of two 

variables t and u. • u • 

From (10.17) we have at once jy = ^(w), since jy* = o; that is, 

y is a function of u only. Conversely if jy = ^(«), yt = Sjy/ dt = o. 

This is a commonsense result; the equation yt = o means that when 
u is fixed the rate of change of y is zero, i.e. changing the value of t 
without changing u has no effect on y. 

(4) Consider the equation By/Bt = dyjdu or y^u — yu\t> where ^ 
is again a function of t and u. 

Here the device we use to solve the equation is to change our 
variables from / and M to ^ and j = / + u. By the equation for change 

of variables a +J«i«Wns- 

But ^t, — I, and since ti — s — iy Wti, = — x, so that 

yt\s=^yt\u - JuU = o (t>y hypothesis). 

By our previous example this means that jy is a function of r = X + 1^ 
only: 

y = + «)- 

(5) Find the value of _y = sin t . cos u + cos t . sin u. 

By direct differentiation Byj Bt — cos t . cos u — sin t . sin u. 
Similarlv cyj Bu — cos t . cos k — sin t . sin u = Byj Bt. Therefore 
by the previous example y is a function of {t -r «), y = sin t . cos ti + 
cos / . sin u — <i){t + »). To find what the function ^ is, put « = o. 
We obtain sin / . i -p cos t . o = sin t = ^(/). Thus 

sin t . cos u + cos t . sin u = sin (t + w)- 

'The reader should compare this method of proving the addition 
theorems with the direct method (Section 5.6). 

PROBLEMS 

(1) Solve the equation t. BylBt = u . cyj Bu by changing the 
variables to t and s ~ tu. 

(2) CJiveii that el In t;dt ^ i; 7 , and In i — o, show that In ^ + In « 
In {ill). 

(3) Using example (3) show that cos t . cos u — sin t . sin u = 
cos (/ -r u). 

(4) Solve t~^cylBt + ii-^dyjBu ^ o. 
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II. z Definite integrals 

Consider a point moving with a velocity which we take to be 
possibly varying, but to be a known function of the time. Then its 

position y at any time is given by jy = J v dti that is the definition of 

the “indefinite integral” J v dt. But we know that this integral is 

not uniquely determined by the velocity v. If is one integral, 

then Y{t) -1- C is also a possible integral 

y = J vdt = y(0 + C . . . (ii.i) 

We have to be told exactly where the point started from, as well as its 
velocity, in order to fix its position for certain. 

Suppose, however, we are interested not in the absolute position y 
of the point, but simply in the distance it has moved in a given time, 
say from time to time If y’l is its co-ordinate at time /j, and 
at time ig, then the distance travelled will be y^ — >’i- We should 
then expect this distance to be completely determined by the velocity. 
For when the velocity of a point is known, we expect to be able to 
deduce how far it has gone. In fact, sinceis by definition the value 
oiy when t = ^j, we see from (ii.i) thatyj = y(/,) + C, and in the 
same wayyg = Y{t2) + C. By subtraction we obtain 

distance gone = yz —yi 

= [V(f.) + c]-[y(/,) -t-c] 

= Y(t2) — Y{tj) .... (II.2) 

The unknown constant C has cancelled out. This formula can be 
expressed in words: to find the distance travelled between times t, atid tj, 
take any one indefinite integral Y(t) of the velocity v, find the values 
Y{t,) and Y(t2) of this integral at times ti and tj respectively, and finally 
subtract Y(t,) from Y(t2). 


EXAMPLE 

(i) A point is moving with velocity v = 2t. What is the distance 
it travels between times /, = i and t^ — 2} 




^57 



BIOMATHEMATICS 


258 

One possible integral of v is Y{t) — J zt . dt — Thus y(i) = 

i2 ^ Y(2) = 2^* = 4, and the distance travelled is y(2) — y(i) = 

3 units. _ . 

It should perhaps be made clear that this “distance —yi 

means simply the distance between the initial position 3^1 and the final 
position yz- It is positive if yz is on the positive side of y^y i.e. if 
yz > yi, and negative if y^ <yv H Poi*^I returns to the place it 
started from then the total distance covered is counted as zero in this 
formula. To evaluate the distance covered by the point in all its 
wanderings, not counting the signs positive or negative, we must take 

•the integral 
in practice. 


v\dt instead of J v dt. This, however, is rarely needed 


EXAMPLE 

(2) Suppose V = sin t. What is the change in position between 
times ^1 = 0 and /g = aw? 

y(/) = J s\n t . dt = — cos t. 

The change in position in therefore 

yz — = (— 0 — (— 0 = 

the point returns to its original position. 

This change in position yz —.Vi is called the “definite integral of 

T from time to time /o” and is denoted by the symbol J^ v dt. 

[This is read “integral t dt from to /o”]- Thus a definite integral 
means simplv a change in position or distance travelled: it is “definite 
because it does nt)t contain any unknown constant C, and it is an 
“integral” because it is intimately connected with the indefinite integral 

!'(/) = J z' dt. In fact 

P^vdt^ y(/,) - Y{t,) . • (II- 3 ) 

Here we must also introduce a second notation which is both very' 
convenient and in common use. 'I'his is to write y(/2)— I 
[!'(/)][ . Here the bracket symbol [ ][; surrounding the function 

i’(/) means “subtract the value of this function when t = ti from its 
value when t t.". d'hus [/-][; means /o- — and [3 sin == 
3 sin /o ~ 3 ^1- Iti this notation equation (11.3) is written in the 

compact form 

f - vdt-[Y{t)C^ 

• f I Cl 


(11.4) 
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The working in examples (i) and (2) above would be set out 
briefly as 

y"^ 2.1 dt = = 2^ — I* = 3 

sin t dt — [— cos = (— i) — (— i) = o 

J 0 

The values and t2oi t between which the integration is performed 
are sometimes called the “limits” or “termini” of integration. This is 
of course a sense of the word “limit” quite different from that special 
technical sense which is used in phrases such as '*dyjdt is the limit of 
hyj ht as ht o.” 

The definite integral can be applied to other rates of change besides 
a velocity, or rate of change of position. If v stands for the rate of 

increase of weight of an animal, then f ^ v dt will be the gain in 
weight between times and li v is the rate at which some meta¬ 
bolic product is being excreted by an organism, then f ^ v dt will be 
the total amount excreted in the specified interval of time. If / is the 
electric current in a wire at time t, then f ' I dt is the total quantity 
of electricity which has passed down the wire during the given interval. 

PROBLEMS 

Find the following integrals: 

(i) p^'^tdt\ (2) p^{i^^\)dt\ (3) f^^e^df, (4) p^te* dt\ 
(5) f " cos / dt. 

•J — W 

XX.2 Areas as integrals 

A more unexpected use of integration is that of calculating lengths, 
areas, and volumes. 

We are all used to the ideas of length, area, and volume in everyday 
life; we are accustomed to measuring the lengths of strings and wires, 
the volumes of liquids, or the area of a wall when it has to be painted. 
So we rarely think seriously about exactly what we mean by a length 
or a volume, and still less do we subject our ideas to careful mathe¬ 
matical analysis. In fact such analysis is normally scarcely ever required. 
If we have to find the length of a complicated path, such as a route 
traced out on a map, we can do it by running a thread along the path, 
straightening the thread out, and measuring it along a ruler. If we 
want to know the volume of an oddly shaped vessel, we can fill it with 
water, and then pour the water into a measuring glass. Such empirical 
methods are very useful for particular problems. But they do not give 
UB a general line of attack from W'hich W’e can obtain a formula for the 
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volume of a vessel of given shape, such as a cone, cylinder, or sphere, 
or the length of a given type of curve. It is the integral calculus which 
provides just such generality. 

First let us consider an area of the following shape. Suppose we 
have a plane in which there are two co-ordinate axes OX and Oy at 
right angles to each other (Fig. ii.i). We shall take OX to be the 



Fig. ix.i—An area considered as a deiinite integral 


axis of .V, and OY the axis of }\ in the usual way. Suppose further 
that we have a curve C in this plane which is the graph of some function 
y = /(.v): for the present we shall suppose that C lies entirely above 
the .v-axis. Let P (a*,, Vi) and O = (.Vg, Va) he two points on this 
curve. Draw PU and QV perpendiculars from P and Q respectively 
onto the .v-axis; suppose further that U is to the left of V, i.e. that 
.v'l < X.,- I’hcn we shall prove that the area A contained between the 
straight lines PU, Ul\ I'Q and the cur\^e QP is simply the definite 

integral /(-v) dx. 'Fhus, for example, suppose that the curve C 
was simply the straight line v = x, so that /(.v) = .v, UP = yi = .Vi, 


VQ --- Vo - ■ ^*2. and A - = x dx — ~ ^ — i .Vj®, if the 


above formula is correct. I'his agrees with the usual formula for the 
area of a quadrilateral with two parallel sides, which is usually stated 
in textbooks as A - - \ (distance between the 2 parallel sides) X (sum 


of the lengths of the parallel sides) 


- i UV X {UP -f VO) = i (.Vo - .V,) (.Vo + X,) 

== i {x^ - xr). 


To see whv this formula holds we first notice that the value of the 
area A depends on the points P and Q we choose on the curve. If P 
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and Q are imagined as moving along C, then the perpendicular PU 
or QV will also move, and the area A will alter. Suppose that P is 
kept fixed, but the right-hand boundary of the area is allowed to move. 
Instead of taking QV as this boundary, let us take where R is 

a point (.V, >*) on the curve, and W the point below it on the x-axis: 
we shall suppose that R is free to move up and down the curve as we 
choose, but keeping always at or to the right of P. VVe shall now con¬ 
sider the area PLIWR, which will be called a. This area will depend 
on the position of W. For when we are given W we can draw a vertical 
line to intersect the curve at R, and so complete the area PUWR. Now 
W is the point (x", o); so that the area a is a function of x only, and can 
therefore be written as a(x). 

There are two particular cases of importance. If W is moved along 
to coincide with F, then R coincides with Q, and the area PUWR=a{^x^ 
becomes identical with PUVQ = A. So, since now x = Xo, 

“(•*^2) = A . . . . (i 1.5) 

On the other hand, if W is moved down towards U the area a shrinks, 
and finally as W coincides with U the area becomes that of PUUP, 
or zero. This is the case when x = Xj, so that 

a(Xi) = O 

Combining this equation with (11.5) we can evidently write 

A a(x2) — a(-v,) . . . (ii-6) 

Now consider what happens to a as x varies. Suppose that W is 
moved to the right to a new point W = (x', o), so that R moves to 

and the area a changes from a(x) = area of PUWR to 
a(x') = PUW'R’. Then (Fig. ii.i) the change in x is 

hx = x’ — x = OW' — OW^ WW' 
and the change in the area a is 

Sa = a(x') — a(x) area PUW'R' — area PUWR = area RWW'R', 

Now draw RH perpendicularly onto W'R’. Then 

area of the rectangle RWW'H = WR x WW' = y Sx 

In the figure the curve C lies above RH, and we see that the difference 
between the areas RWW'R' = Sa and RWW'H = yhx is the area of 
the triangle RHR'. Now when Sx is small, so that W' is near W, this 
triangle will have a very small area—small even in comparison with 
the rectangle RWW'H. That is to say that the areas RWW'R' and 
RWW'H are very nearly equal when hx is small: 

ha. <=^ y Sx 

and the smaller Sx is the more nearly this will be true. A little thought 
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will show that this will be true for any reasonable shape of the curve C: 
the areas RWW'R' and RWW'H will always be nearly equal whether 
C lies entirely above RH, entirely below it, or partly above and partly 
below. In all cases 5 a y Sx, and the smaller 6.r the smaller the error 
will be in comparison with 5 .v. But we can write this equation as 
Ba/Sx — y; and if we take the limit as 8.r ^ o we find that dajdx = 

y — /W- 

Expressed in another way this means that a = a(^) is an indefinite 
integral of y = /W* Equation (11.6) shows that A is the difference 
between the values of this indefinite integral when x = x^ and a: = x^; 
that is to say that A is by definition the definite integral 

A ^ f"^ydx= f"^f{x)dx . . (11.7) 

J Xi ^ Xi 

The are.a A under a curve C is the integral of the ordinate y. 

The importance of this result is two-fold. Firstly it shows how to 
calculate an area. Secondly it gives us a method of picturing an integral. 
Suppose, for example, that we know the velocity of a moving body 
at time t. Then the distance moved between times /j and will be 

f V dt. Now draw the graph of v against t, v being plotted 

vertically and t horizontally (Fig. 11.2). Then the distance moved will 



Fig. ti.2—Distance travelled expressed as an area under 

the velocity-time graph 

be equal to the vertical area (shaded in the figure) under the curve 
between the points and t^y bounded above by the curve, below by 
the /-axis, and on the left and the right by vertical lines through the 
points (/,, o) and (/o, o). The only question which arises here is that 
of the unit of area. If the scale divisions along the / and v axes are 

each equal to the ordinary unit of length, then the integral J ^ v dt 

will be equal to the area in ordinary units. If, however, we use for 
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convenience some other scales, then, in order to agree with the integral, 
the area will have to be measured in units equal to the area of a rectangle 
whose width is equal to one i-scale division and whose height is one 
t^-scale division; for such a rectangle represents the distance travelled 
in unit time with unit velocity. 


EXAMPLES 

(i) Find the area under the parabola y = x“ between the vertical 
lines a; = JCj and x = X2 (Fig. 11.3). 



I’he area 

A = y dx — x^ dx 

(2) Find the area under the rectangular hyperbola^ = i/x between 
the vertical lines ae = i and x = X2- 

The area A = { ^ ~ [In x]\-’ ~ \n Xz- 

J1 X 

This property is sometimes taken as the definition of a natural 
logarithm; it is fairly easy to deduce from it all the other properties of 
the logarithm, including our definition by means of a spiral. 

(3) Find the area of a semicircle. 

A semicircle can be regarded as the area above the a:-axis contained 
within the curve x"^ y^ ~ which is the equation of a circle centre 
O and radius R. The value of _y for points above the a:-axis is therefore 
\/{R} — x^), with the positive value of the square root (Fig. 11.4). 
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Fig. 11.4—The area of a semicircle 


Furthermore the values of x range from —R to R. Thus the required 
area is (see Table 10.2) 


A = f” „ VR^ - dx= \ [i?- sin-> (xjR) + x VR^ - 


= l[R- . i + o] - hlRK-h^) + O] - ^ 7T /?2 


By doubling this we find the area of the whole circle to be ttR‘. 

In the same way we can show that an ellipse x'^far + = i of 

major axis of length 2a and minor axis of length zb has area irab. This 
is almost evident from the fact that the ellipse is a compressed circle. 
The original circle has radius a and area na-; it is compressed vertically 
in the ratio bja, so that the resulting ellipse should have area -n-a- . bja = 
•nab. 


PROBLEMS 

(1) Find the area under the curve y = \/x between x = o and 
X = 2. 

(2) Find the area under the curve y ~ beuveen a: = o and 

.V = I. 

II. 3 Calculation of areas by the summation of thin vertical strips 

There is a second and alternative way of thinking of the calculation 
of an area or of a definite integral. 

Suppose again that we wish to find the area A under a curve 
r /(.v) between the vertical lines x — and x = and above the 
.v-axis (Fig. 11.5). We have already shown that this area is the definite 

/ T^ 

y dx. But, instead of performing the calculation in that 
way, let us suppose that the area is divided up into small strips by a 
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number of vertical lines. If, for example, we div'ide it by lines through 
the points X^, . . . + i = x^, then it will be divided 

into n strips: Fig. 11.5 illustrates the division into six strips by seven 
vertical lines (including the left-hand and right-hand edges and five 
intermediate lines). Let us suppose that the corresponding values of 
y on the curve are Fg, Y^, . . . Fn + i = y^. 



Now if the first strip is sufficiently narrow we can approximate to 
it by a rectangle of height Kj, shown shaded in Fig. 11.5, by drawing 
a horizontal line across the strip through the first point P on the curve. 
There will be a small difference between the area of the strip and the 
rectangle: but if the strip is sufficiently narrow the error will be 
inappreciable. Similarly by drawing a horizontal line at height Y^ 
through the second point (A'g, Fg) on the curve, we can find a rectangle 
in the second strip approximating to its area. By continuing in this 
way we can place in each strip a rectangle approximating to its area. 

Let us add the areas of all these thin rectangles together, and call 
the sum S. Then S will be an approximation to the area A \ and the 
more strips we have, and the thinner they are, the nearer S will be 
to A. 

We can write down an expression for S. The first rectangle has 
height y, and width X2 — -V,; its area is therefore F, {X2 — X^). 
'rhe second rectangle has height Fg, width X^ —Al’g, and area 
F2(Ar3 — AT2). 'I’he further rectangles follow in a similar way, so that 

5 = Y,{X 2 -X,) f F 2 (Ar 3 -^ 2 ) + . - • + Y,XX„,,-X^) (11.8) 

If we agree to write X2 — AT,, the width of the first strip, as simply 
SA',, and X^ — X^ as SA'g, and so on, then (11.8) can be w’ritten as 

.S’ = F.SAT, + FgSA'g + FgSA'a -I- . . . + F„SJf„ 
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or in a very convenient notation 

5 = -2:(ySA') .... (11.9) 

where the capital sigma is simply an abbreviation for “the sum oP*, 
so that S{YhX) means “the sum of all quantities of the form YhX*\ 
i.e. Yn^Xji, 

If now we take more and more strips, and make them thinner, then 
the sum S will approach A ; or more precisely, A is the limit of S as 
the maximum width of any strip tends to zero. This is practically 
common sense; a formal proof that the limit of S exists and is equal 
to the definite integral will be found in textbooks on the Theory of 
Functions. Notice that in this definition there is no need for all the 
strips to be of equal width, provided only that the width of the widest 
strip is made to tend to zero. However, for all practical purposes it 
is sufficient to consider only the case of strips of equal width, and in 
that way to simplify the formula. We shall then have 

8X^ = 8Xz= . , . = SX„ = 8 (say), and 

y^s + y^s + ... + y„s 
= s.(y, 4- y, 4-... + y„)z= 8.i;y. 

Also since there are n strips of width 8 each, the total width of the 
area must be «8 and this must be equal to (a'j — x^). Thus 8 = 
(jc, — Xi)/n and 

^ = litn 5 = Urn (^2 — Xi)(Yi + Ta + • . + Y^)ln (ii.io) 

CO n—► CO 

since as « 00 the width 8 = (jcg — ^i)/« of each strip tends to zero. 

This provides an alternative method of calculating an area. 

11.4 Arithmetic series 

The basic formula (ii.io) can only be applied in practice if we have 
a method of evaluating sums of the form Tj -f- Tj + . . . -H Yn^ 
Suppose, for example, that it is to be applied to find the area A under 
the straight line y — a where a and ^ are constants. Then 

y, = a 4- I'a = a + Tg = a 4- ^X and so on; and 

yi 4 -y 2 + -- - + ^n“(“ H“^Aj) 4 -(a 4 -^A' 2 )+ . . - +(ct ^Xn) 

— no. 4 “ ^{Xi + 

Now A'l, Xz ■ . • Xn are equally spaced points: X^ — X-i = X^ — 
Xz — X^ — Xz = . . — 8. The numbers X^, X^ . • • X„ are said to 
form an “arithmetic series” or “arithmetic progression”. Since such 
arithmetic series are of fairly common occurrence, their properties are 
worth mentioning for their own sake. 

Consider therefore a series of equally spaced numbers X^, Xz, 
X3, . . . such that Xn — X^ = Xq — Xz = . . . = 8. (Examples are 
1, 2, 3, 4, ... , with 8=1, and i, 4, 7, 10. 13, ... , with 8 = 3.) 
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This Spacing 8 is known as the “common difference” of the series. 
We then have 

JC 2 = S 

AT3 = 2 “1“ ^ ~ (-^1 "f" “H 8 = +28 

JC^ — JC3 -f- ^ = (-^1 “i“ “ 1 “ ^ = ^1 3 ^ 

JC^ — AT, + 8 = (ATj + 38) -f- 5 = ATi + 48 

The general formula for the rth term X, is evidently 

ATr = A'l + (r - i) 8 . . . (II.II) 


PROBLEMS 

(1) An arithmetic series has first term 5 and common difference 3. 
What is the rth term AT^? If there are 7 terms in all, what is the last 
term? 

(2) An arithmetic series of 10 terms begins with 5 and ends with 
104. What is the common difference, and what are the terms of the 
series ? 


Suppose now that we wish to find the sum s = X^+X • • • +A’„ 
of such an arithmetic series. We shall illustrate the general process 
on the particular series 3, 5, 7, 9. This is shown in Fig. 11.6: the first 



Fig. 11.6—The sura of ao arithmetic scries 


term, 3, is represented by the line of 3 black circles joined together; 
the second term, 5, by a line of 5 black circles, and the remaining 2 
terms by lines of 7 and 9 black circles. The sum s will then be equal 
to the total number of black circles. 

Now add to this scries an equal series written in reverse order: 
94-74-54-3, This is represented by the white circles in Fig. 11.6. 
The total number of circles, black and white together, will now be 2s. 
But we now find that every row is of equal length. The first row is of 
length 3 4-q = 12. In the second row the number of blacks is increased 
by the common difference 2, while the number of whites is diminished 
by 2. Thus the total is unaltered. The same applies to the third row, 
and 80 on throughout. Thus the circles are arranged in a rectangular 
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array, and the total number zs can be calculated by multiplying the 
length of a row by the number of rows, i.e. 12 X 4 = 48. Dividing 
by 2 we find s = 24. 

If we repeat this process with the general series -f*-^2 + • • . -h^n 

we shall again obtain a rectangular array. The length of the first row 
will be Xi 4 - X„, the sum of the first and last terms (represented by 
the black and white circles respectively). The number of rows will be 
the number of terms, n. We have therefore 

zs — number of circles in rectangle = n(Xi + X„) 
whence s = n . ^(X^ + Xn) . . . 

or in words the sum of an arithmetic series is the product of the number 
of terms into the average of the first and last terms. Since Xn = X^ + 
(n — i)S this can also be written 

5 = [aA^j + (« — i)S] . . . (11.13) 

These formulas are very useful. For example, let us take the simplest 
possible series i 4-24-3 4 - 44 -** - 4-w. Applying (11.12) we find 
its sum to be n x average of first and last terms = \n(n 4- i)- 

Another interesting series is that of the odd numbers i 4-3 4*5 4 - • • • 
with common difference 2. The successive sums are then simply the 
squares in order: 1 = 12^ i 4- 3 = 4 =s 2^, i 4-34-5 = 9 = 3®* To 
show this in general for n terms we apply formula (11.13) with first 
term A”"! = i and common difference S = 2. We get 

s = hi [2 + 2 (n — i)] = • 2« = 

This is illustrated in Fig. 11.7, where we see that 14-3 fills a square 

of side 2, 1 3 4- 5 a square of side 3, and 

I 4-34-54-7a square of side 4. 

Now let us apply this formula to find the 
area under a straight liney =: a 4- between 
ordinates at .Vi and X2. We have already 
shown that if we divide this area into n strips 
of equal width by ordinates at ATj = .Vj, A'jj, 
A'a, . . . , A’’„4 , = .V2 then the area is approx¬ 
imated to by the sum (formula ii. 

5 = (.Vn — .Yj) ( 1 '^, 4 - 2 4 " • • • 4 - T„)/;i 

and 

-1- 2 "h ■ . ■ 4 “ ^ n = 4 - ^ (A 1 4 " ATg 4 - - - • 4 - X„) 

But bv (11.13), 

A| 4 " A 2 4 ~ • ■ • 4 “ A„ — [2AI 4 “ (^* — ^)^] 

Furthermore since there are n strips making a total width of — Xj 
the width of each strip, or common difference 8, is (a; 2 — Xj)/n. Thus, 
combining these results, 




Fig. 11.7—The sum of 
successive odd numbers 
is a square 
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Xi + X2 Xn = \n [aA'j + (« — i) (xg — :Vi)/«] 

= in [2a:i -f (^^2 — Xj) (i — ijn)] 

= [•’Cl + .To — (t 2 — Ti)/n] 

Y\ + y 2 + ■ • • + Yn = n { a + [jCj 4- Tg — (To — T,)/n]} 

5 = (T2 — Ti) { a + -^/S [Ti + T2 — (to — T,)/n] } 

= (T2 — T,)a 4 ^(T2=^ — Ti=)^ — ~ 

Now the area A is actually the limit of this sum as n, the number of 
strips, tends to infinity. The first two terms in the expression for S 
do not contain n, and will not be affected. But as n ^ 00, i/n —> o, 
and therefore the third term — — x^Yjn will also tend to o, so 

A = lim S = (t 2 — Ti)a 4 ^(.Vo^ — 

The reader can compare this method of finding the area with that of 
direct integration 

f^^ydx= rUa-\-^x)dx 

Jxt J Zi 

= [or 4 

= [oTa 4 i) 3 T 2 ^] — [oTj 4 
= a(T2 — T,) 4 h^X^^ — Tx^). 

This gives the same answer, and is evidently the simpler and quicker 
way of performing the calculation. But, as will be seen later, the 
principle of the strip method is of great importance. 

PROBLEM 

(3) Sum the series i 45494134... to « terms. 

The sigma notation 

We have already noted the use of the Greek capital sigma T as a 
sign of summation: Tt means “the sum of ail quantities t“. 

If we wish to be more precise we denote a sum like Tj 4T2 4 

n 

T3 4 . . . 4 T„ by the symbol ^t,., read as “sigma from i to n (of) 

r — I 
n 

Xf . That is, ^ means “sum for all values of r beginning with i and 

I 

n n 

ending with n**. 'I’his could also be denoted by ^ t,, or ^Tq. All 

1=1 asi 

these mean exactly the quantity t, 4 Tj 4 . . . 4 and usually it 

n n 

Will be sufficiently clear to write simply £ x^, or H Xa, or even frequently 

I t 

just S Xa. (Sometimes the letter S is used instead of T.) 
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In this book we shall almost invariably follow an excellent conven¬ 
tion, suggested by Dr. C. A. Rogers, that any suffix which is summed 

n 

over in this way will be denoted by a Greek letter. Thus 2 Xa will 

X 

mean JCi + ^2 + . • + will mean Xi^ + x^^ • + x„^ (and 

1 

n « 

must be carefully distinguished from [2 the square of ^ Xat 

t I 

n 

which is read as “sigma from i to n of Xay all squared**), and Z oxa will 

I 

mean i x^ + 2x3 + 3X3 + . . + wx„. The effect of this convention 
is that complicated expressions can be much simplified: thus 2 Xan 
where there are two suffixes, is to be summed over the Greek a, but not 
over the Latin r, i.e. it means Xj,. + Xgr + JCar + • • • ^act one can 
with sufficient care even leave out most of the sigma signs, writing 
Xa or .Xa® and leaving the Greek suffix to indicate that these are summed 
as 2 J Xa or £ Xa®. But we shall not do that here.) 

This use of sigma explains the notation for an integral. We have 
seen that an area A can be considered as the limit of a sum 2 YBX, 
where Y denotes the ordinate in a strip and SA" the width of the strip. 

When we take the limit we obtain the integral denoted hy J y dx. 

Here the sigma is replaced hy a J y which is simply a form of the letter 
iS standing for Sum, and SA’ is replaced in the limit by dXy as in differen¬ 
tial calculus. Although the symbols J and dx have strictly speaking 
no meaning in isolation they serve to remind us that the integral can 
be approximated to by a J um of small ^/ifferences or small strips. 

The great advantage of the sigma notation is its compactness. But 
this compactness is bought at the price of making the sjTnbolism a 
little harder to comprehend. It is easy to take in at a glance the meaning 
of the series (i -}- 2 -f- 3 -f . . . + «); it is less easy when it is written 

n 

as 2] some practice is needed in getting used to such expres- 

a I 

sions. We therefore give some formal rules of operation which may be 
helpful. 

n 

The expression T(.Va + Ja) means (.Vj -hyO - 4 - (xo +3^2) + • ■ • 

1 

+ (x„ 4 - 3 'n) when written out in full. But this can be rearranged as 

n n 

(vVi + <^2 + • ♦ • -T -Vn) (^*1 “i“ 3*2 “!“••• H' 3 n)> Of Xa “f" ^ ya> T huS 

1 1 

ti n n 

T(.Va -fya) == T.Va + iTja • • (1I.I4) 

t t 1 
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In the same way 

n n ft 

S (^Xa — ^ ^ ^ ya 

t 1 1 

+>'a + Za) = ^2:Xa+ ^ ya + ^ Za 
X ‘ I J I 

^ (Xa + + Xa^) = i7 ATq + ^ Xa^ + Xa^. 

X 111 

ft 

If k is a constant (independent of-a) then 2 ^ (kxa) means kxi + kx^ -f- 

I 

n 

. . . + kx„y or ^(jCi + Jf2 "H • • • + x„), or k Xa- Thus we have the 

t 

rule for multiplication by a constant: 

ft n 

Z kXa = k Z Xa . ( 1 I.I 5 ) 

I 1 

If instead of multiplying by a constant we add it, we see that 

fl 

S {k Xa) means {k + a'i) + (^ + -Vg) + . . . + (A + which equals 
+ (jCi + *2 + • - - + •x'n) — nk + ZXa- Thus 

X 

Z(k + ATa) = + ZXa - - . (11.16) 

I I 

These relations will be sufficient for our present purposes. Certain 
other important ones are given later (Section 13.14). From our relations 
we can infer such equations as 

iT (2 + 3X« + .V(X^) = 2« + 3 27 Jfa + -^ JCa*. 

I I I 

XZ.6 Generalized arithmetic series 

It is sometimes necessary to find the sums of more complicated 
series, such as i* -f- 2-* + 3® + . . . + where the rth term is r^, or 
I 4-3 + 6 + 10 + ... + i«(« 4- i), where the rth term is \r(r i). 
We may call a series one of “generalized arithmetic type” if the nh 
term can be expressed as a polynomial in r: 

x^ A -\- Br + Cr~ + . , . 4- //r'*. 

In an ordinary arithmetic scries the rth term is Xr = X^ 4- (r — i)S = 
— ^) 4- Sr, so that this is included in the form A 4- Br with 
^ = X^ — 8 and B = h. These series do not occur so often in general 
biological work, but are sometimes needed in statistical calculations, 
and it seems convenient to consider them here. 
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The key to the problem of summing such series is contained in 
the follo\ving propositions: 

n 

I + 2 + 3 + . . = = \n{n + i) 

1 

I.2+2.3+3.4 + .. +K(n + i) — 2 a(aH-i) = \n(n-\-i)(n+ 2 ) 

X 

I.2.3+2.3.4 + 3.4.5 + .- - + M(nH-i)(K+2) 

= 2'a(a + l)(a+2) = iH(«+l)(«+ 2 )(«+ 3 ) - - (n-I?) 

! 

and so on. 

To see how these formulas arise let us prove the second one: this 
will illustrate the general method of proof, which can equally well be 
applied to the others. We want therefore to find an expression for the 
sum 

^ = I . 2 + 2.3 -t- 3.4 + . . • + (« — + «(« + 0 

First multiply by 3, obtaining 

35 = 3(1 . 2) -r 3(2 . 3) 4 - 3(3.4) -f . . . + 3(« — i)n + 3 n(n + i)- 

Kow 3= — I + 4= —2+5=... = — (m — 2 ) 4 -(«-f'i) = 
_ 4- 2), so the expression can be rewritten as 

3^=1 . 2 . 3 + (-1 -i 4) • 2 • 3 + (-2 + s) . 3.4 + - . . 

-r[—(«—2) 4 - (?/ -^i)](« — i)h+[—(«—!)+(«+ 2)]n(MH-i) 

= 1.2.3 — 1.2.3 ’-2.3.4 — 2.3. 44 - 3 - 4 * 5 “"--* 

—(h— 2)(7/ -1);/4-(« —+ (« —i)«(« + i)+«(«-ht)(«+2) 

= h (77 4 -i)(«+ 2 ) 

since the first two terms cancel out, so do the ne.xt two, and so on in 
pairs, and only the last term is left. Dividing through by 3 we finally 
obtain the required answer 

5 = ^u{n 4 - i)(« -i- 2). 

PROBLEM 

(i) Repeat the procedure with the series 
1 -{- 2 -^ . . . 4- w and I.2.34-2.3. 44 -.•• + «(« + i)(« + 2)* 


A sum such as + 2- 4 - 3" 4 - • • • 4 - w" can now be found by the 
following device. 
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= I . I = 1(2 — l) = I . 2 — I 

22 2.2 -- 2(3 — l) = 2.3 — 2 

3" = 3 • 3 = 3(4 — i) = 3 • 4 — 3 

and in general 

a* = a . a = a [(a + 1) — l] = ci (ci 4 * l) — 

This rearrangement splits the original series into two series whose 
sums have already been found. 

Thus, with all summations running from i to w, 

S = .27 a (a -j- l) — .27 a 

= + i)(” + 2) — i«(n +-0 

= n(« + i) [i(« 4 - 2) — 

= n{n 4 - 1) [i • 2(n 4- 2) — i . 3] 

= Jtt(n 4- i) [2(« 4- 2) — 3] 

= Jn(n + i)(2?i 4 - i) 

i.e. i2 4- 2^ + . . . -\- = in{n 4 - i)(2« 4 - O- 

The same device can be used to sum i® 4- 2® 4 - 3^ 4 - • • • 4- Here 
the general term a® must be expressed in terms of a(a -f- i)(a + 2), 
a(a 4- i), and a, which are series we already know how to sum. Now 

a(a 4 “ l) — 4 " ct 

a(a 4 - l)(a 4 - 2) = a® 4 - 3a® 4 - 2a. 

Since a(a 4- i) and a do not give terms in a®, we must take the 
expression a(a 4- i) (a 4- 2) to give an a®, and we can write 

a® = a(a 4“ l)(a 4-2) — 3a® — 2a. 

But we have already seen that a® = a(a 4 - i) — a, so that substituting 
this in the above expression we find that 

a® = a(a 4 * l)(ct 4“ 2) — 3«(“ -f- l) 4 " « 

and therefore 

i 7 a® = i 7 a(a 4- l)(a 4 - 2) — 3 27 a(a 4 - I) 4 - -S'« 

= i«(n 4 - i)(« 4 - 2)(« 4 - 3) — 3n(« 4 i)(« 4 - 2) 4 - i«(H + i) 

= i«"(« 4 - i)" 

We can continue in this way to find the sums of higher powers. 
The calculations are straightforward but laborious, and lead to the 
following results. 


T 
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Table ii.i—Sums of Powers 


1 2, n — 

+2== H- . . . + W- = 

l3 23 + . . , + = 

+2^ + . . . + = 

I® + 2® + . . . 4- = 

i« + 2« + . . . + n» = 

l’ + 2’ + . . . 4- «’ — 

I® + 2® + . . . +7/® = 


4 «(i + n) 

l(?i + h2 ) 

^n(i + »)(i + zn) 

h{n + 37*2 + 271 ®) 

Mi +«)" 

l{n^ + 271 ® + 71 *) 

5Vn(i + 7 i)(i + 2n)(— I 4 - 371 + 371®) 

M— « + lO”® + 15”* + 6 m®) 

yV«^( I 4 - m)®( — I 4- 2M 4- 271 ®) 

lV(— + 5M^ + 671® 4- 27 i*) 

^ 7 l{l 4 - 70 (i + 27 i)(i — 371 4 - 671® 4 * 3 ”^) 

5 ir(« “ 7 K® 4- 2171 ® 4 - 2 IM® 4- 671’) 

^\n%i 4 - «)‘*(2 — 4M — 71 ® 4 - 671® 4 - 3 ”^) 
^V(2M® — 771* 4- 1471* 4- I 271 ’ 4 - 3W®) 

^ 7 i(i 4 -«)(i 4 - 2 n)(—3 4-9W—«“— 15 «^+ 

57i*4-i5«®+S«*) 

y\5(—371 4 - 2071 ® —42M® 4 - 6 o 7 i’ 4 - 45 «® 4-10M®) 


For further information about the properties of these sums, and 
for a simpler method of calculation, see T. J. FA. Bromwich, An Intro¬ 
duction to the Theory of Infinite Series (2nd edn., 1931, Macmillan), 
Chapter XI. 

One curious identity emerges from these formulas: 

(i® + 2® 4- 3^ -h . . ■ 4 - 71 ®) = (i 4-2+3 + . . . 7 i)®. 

This is illustrated in Fig. 11.8, where we see that a cubical arrange¬ 
ment of objects of side 2 can be cut into 3 rectangular slices, 1X2, 
2 X 2, I X 2. By rearranging these slices as in the lower half of the 
diagram we can miike i® + 2® objects fill up a square of side 1+2. 
Similarly a cube of side 3 can be cut into slices i X 3, 2 X 3, 3 X 3 > 
2 X 3, I X 3, and by adding these on to the square of side 1+2 we 
get one of side i -^-2 4' 3. 

These generalized arithmetic series can be used to find the area 
under the parabola y — x- between the ordinates = o and :i2 ~ i* 
Let this area be divided into n strips of equal width. Then by (ii.io) 
the area A is the limit of the sum 


5 - (x, - .vO(F, 4- Fs + . . . + FJ/Vi. 

Now X2 — ‘V, = I, and ~ A'l®, Fg = -^2^1 • • • F„ — ATn®. Thus 

5 == (A’r + A%® + . . . 4 A„®)/7 i. 


1 
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o—o—o 

m 



Fig. IX.8 —The sum of successive cubes 


Now if S is the width of each strip, X^ = + S = 8 since Xi = o. 

Similarly X^ = 2S, X^ = 3S, and X„ = (n — i)S. So 

S = (SV«) . [i"' + 2“ + . . . + (« - lYl 

But by replacing ti by (« — i) in the formula for i* + 2^ + . . . -p n® 
we see that 

I* + 2^ + . . . -I- (« — l)* = i (« — l)«(2« — l). 

Furthermore since the total width of the area is i, and it is divided 
into n strips, the width of each strip must be 8 = i/n. Making these 
substitutions we find finally 

S = • h(« — i)"(2» — i) 

Now, as fi -> 00, ifn o, and therefore iS ^ . i . 2 = This 

is the required area. 

Alternatively by direct integration we have 

A= f' ydx= x^dx = = J ~ o = J 

•'0 •'0 

(Again the integration method turns out to be simpler.) 
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The sum of any generalized arithmetic series can now be easily 
found. Suppose that the rth term of the series is Xr A Br 
Cr^- + . . . + /fr*. Then 

Exo. = E{A -\~Ba + Ca^ . Ha^) 

= flA -f- BEa "I- CEa^ -j- . . . -f- HEa.^ 

and the values of the sums Ea^ Ear^ ,.. Ea^ can be read from Table ii.i. 

EXAMPLE 

(i) Find the sum to 71 terms of the series i, 7, 19, . . . whose rth 
term is = i — 3r + 

We have 

E Xr ~ n — 3 Ea 4- 3 Ea^ 

= « — 3 • + 3 • + 3”^ + 2n®) 

~ n ~ 4- w^) + \{n + 3«’^ + 271 ^) 

= 


PROBLEMS 

(2) Find the sum of the series 5 4* 23 + 53 + ... to « terms, 
given that the rth term is 6«^ — i. 

(3) Find a formula for the rth term of the series 2 + 10 4- 24 + 
44 + 70 + 102 4- 140 + . . . , and for the sum to n terms. 


IZ.7 Integrals as the limit of sums 


We have seen that an area can be considered either as an integral 
y dx or as the limit of a sum EYhX. The importance of this 

J *. 


result does not lie in providing alternative methods of calculating an 
area, for, as we have seen, the definite integral is almost always much 
easier to calculate than the limit of the sum. It lies rather in showing 
that the two e.xpressions are equal, so that whenever a quantity can be 
expressed as the limit of a sum like EY8X we know that it must be 


the integral J y dx. 


Consider the problem of finding the work done in stretching an 
elastic cord or spring from length .Vi to length Xa. If / is the force of 
tension in the spring at length x, then / will be a function of .v. (In 
general the greater the length .v of the cord the greater the tension /.) 
Now imagine the extension done in a series of small steps; first from 
length A', = x^ to length A’2, secondly from length X^ to A’3, thirdly 
from A'3 to A'4, and so on (ending at Xn+i = .V2). Let the tension at 
the beginning of each of these steps be Fj, F^ . . . respectively. 
Then if the first step is sufficiently small the tension will be approxi¬ 
mately F, throughout, and the work done in extending the length by 
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BXi = X2 — Xi will be very nearly FjSA'j. In the second step the 
work done will be nearly equal to F^^X^, where hX^ = X 2—X^’. and 
in the whole extension the total work will be £FaBXa nearly. The 
smaller the steps the better the approximation will be. In the limit as 
the greatest step 5 ^ tends to zero the work must be exactly expressed 
by the integral 

W= r^fdx. 

If the cord obeys Hooke’s Law, f= A Bx where A and B are 
constants, then 

W = {A + Bx) dx 

= \Ax + lBx^]l^^ 

= [Ax2 + ^Bx^) — [Axj + 


From now on we shall abandon the use of capital letters in the 
sum £Y8X; we only needed them to distinguish between the limits 
of integration Xi and X2 and the boundary points of the strips X^, 
X2, . . , Xn + i. We can put our rule in the form 


1 / a small change 8Q in any quantity Q can be expressed approximately 
in the form 8Q cs ySx, where y and x are other quantities functionally 
related to Q, then, when x changes from x, Xj, Q changes by the integral 

/!“ y 

•y soi 


For instance, any force F acting along a line will do work W = 

F dx in moving a body from position x, to Xo in this line: for 

the work done in a small change of position Sx is 81 V ~ F 8x. If the 
force itself does not act in the line in question the above formula will 
still hold, provided that we take I' to be the component of the force 
along the line. If we plot the graph of F against x, then the work done 
will be represented by the area under the curve. 

Consider a gas at pressure P and volume V. I'he work done by 
the gas in expanding by a small amount 8V will be approximately 
P8V. Thus the work done in expanding from a volume K, to a volume 
y2 approximated to by the sum £P6V, or in the limit it is exactly 

expressed by the integral J^^'‘ P dV. If the gas obeys the law PV = 

nJRT and the expansion takes place at constant temperature T, then 
F = nRTfV, and the work done is 


W = fiRTjV . dV = [nRT In F] 
= nRT{\n — In F,). 


If a body is moving with velocity v at time t, then the distance 
by travelled during a short interval of time ht will be approximately 
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the product v 8/ of the velocity and the length of time, 8/. In a fimte 
interval, from to ^2. the distance moved will therefore be Jv dt. 

11.8 Areas of loops 

The areas considered in previous sections have been of a very special 
form, with three straight sides and one curved one. However, it is ^ 
easy matter to obtain the area of any curved figure. Take first a simp^ 
loop ABCDA (Fig, 11.9) with left-hand end A and right-hand end C. 



U V K M L N 


Fig. XX.9—The area of a loop 


Draw perpendiculars AU, CV on to the x~axis. The area AUVCDA 
can then be evaluated by integrating jy with respect to x along the upper 
part of the loop 

area AUVCDA = f y dx. 

J AX>C-' 

The area AUVCBA can similarly be found by integratingy along the 
lower part of the loop. But the area of the loop is simply the difference, 
area A VVCDA - area A UVCBA. 

A more complicated figure such as EFGHIJE in Fig. 11.9 can be 
dealt with by a similar procedure. Here the area of the loop = area 
EKNIJE — area EKLFE + area GMLFG — area GMNIHG, and 
each of these areas can be calculated by integration. 



Fig. 11.10—The area of a segment of a parabola 
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EXAMPLE 

(i) Find the area AOB between the parabola y = and the line 
y = \ (Fig. II.lo). 

A and B, the intersections of the parabola and the line, are the 
points (— I, i) and (i, i) respectively. Drawing perpendiculars AU, 
BV onto the rv-axis we have therefore 

area AUVBCA = i dx = z 

area AUVBOA = .v* dx = = f 

area of loop =2 — f = I- 

11.9 Areas divided into curved strips 

So far we have considered areas as divided into a large number of 
thin straight strips, and thereby in the limit expressed as an integral. 
The calculations can often be simplified by using curved rather than 
straight strips. 

The basis of this method is that a strip of length L and uniform 
thickness w, where to is small, has an area very nearly equal to loL 
(Fig. II.ii). For imagine this strip cut into small segments of length 



SL by lines perpendicular to the strip. Each of these segments will be 
nearly rectangular in form, and will therefore have area length x 
breadth = whL. By the addition of all these small segments we find 
the total area to be EwhL = wZhL (since w is constant) = wL, 


EXAMPLES 

(1) The area of a circle ot radius R (Fig. 11.12). 
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Let US imagine the circle divided up into a number of rings by 
concentric circles of radius r varying from o to R. Take any one such 
ring, say of radius r and width Sr. Then its area SA will be its length 
27 rr (Section 6.12) times its width 8 r; SA — Zvr Sr. The area of the 
whole circle will be approximately the sum of the areas of all these 
rings, or USA = Z' 27 rr 8r; and the narrower the rings the more accurate 
this formula will be. Thus in the limit the area will be exactly 

zttt dr ~ = ttR- . , (ii,i8) 

(2) The area of a sector of a circle bounded by two radii making 
an angle 6 with one another (Fig. 11.13). 



Fig. 11.13—The area of a sector of a circle 

Again the sector is to be divided into strips of uniform width between 
the arcs of concentric circles. If 6 is measured in radians then the 
length of any such strip of radius r will be rd, and if its width is Sr its 
area will be approximately SA = rO Sr. The total area of the sector 
will accordingly be obtained approximately by summing these strips 
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to find ErO Sr; or in the limit when the width Sr tends to o it will be 
exactly expressed as 

So far we have considered only plane areas. This method of cutting 
the area up into strips is equally applicable to curved surfaces. 

FURTHER EXAMPLES 

(3) The area of the curved surface of a (circular) cylinder of length 
L and radius R (Fig. 11.14). 



We shall imagine the base of the cylinder to be horizontal, and the 
axis vertical, as in the figure. Let the surface be divided into thin 
circular strips by cutting it by a large number of horizontal planes. 
Call the height (above the base) of a typical cutting plane “a:”: then the 
width of such a strip is the change in x between the bottom and top 
of the strip; i.e. it is 5 .v. 'I'hc length of the strip is the circumference 
ItiR of the cylinder, and its area is therefore approximately ZttR 8x. 
The area of the whole curved surface is approximately E znR 8x, or 

exactly znR clx, since x varies from o at the base to L at the top. 

But ZttR dx = [^TrRxf^ = ZttRL. 

A similar formula holds for the general cylinder or prism, defined 
as the volume swept out by any plane figure moved a distance L per¬ 
pendicular to itself. Such a triangular prism is shown in Fig. 11.14; 
again we imagine the base to be horizontal. If the circumference of 
the base is C, the area of the vertical surface is CL. For we again cut 
the surface into strips of width 8 x by planes parallel to the base. The 
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area of such a strip is approximately CS.r, and the area of the whole 

is Z C 8 x, or exactly C dx in the limiting case when the number 

of strips increases indefinitely, and the greatest width tends to zero. 

But Cdx= [Cx]^ = CL. One can in fact take a rectangle of 

paper or similar material of sides C and L and wrap it exactly roimd 
the cylinder or prism, thus showing in another way that its area must 

be CL. , . 1 - • j u 

If it is the whole surface area of the cylinder which is required, then 

the areas of the two flat ends must be included. For a circular cylinder 

each end vAW have area ttR^ (Example i), and the total area will be 

27 tRL -h 2ttR- — 2ttR{R + L). 

(4) The area of the cur\'ed surface of a (circular) cone with base 
of radius R and height H (Fig. 11.15). 



\Vc again imagine the base of the cone to be horizontal, and the 
axis vertical, as in the figure. If L is the “slant height” of the cone, 
i.e. the distance from vertex to base measured on the surface, then by 
Pythagoras’s 'Pheorem 

L = + m 

Again let the surface be divided into thin horizontal strips, and let 
.V be the distance of a typical strip from the vertex, measured down 
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the sloping surface of the cone. Then the width of the strip will be 
the change in the value of x between its two edges, i.e. Sx. The len^h 
of the strip will be 27 rr, where r is its radius. But by simple proportion 
(see Fig. 11.15) rjx = R}L, i.e. r = xRjL, and the length of the strip 
is therefore ^ttxRJL, and its area (approximately) 27 txR BxfL. Adding 
all these strips together we find the total curv^ed surface area to be 
approximately Z" zirxRhxjLy or, in the limit, exactly 

zttxR dxjL = [-nx^RjlSf' 

Jo 0 

= ttRL 
= TrRViR^^ 

For the total surface area of the cone the base must also be included, 

giving the final formula ttR [i? + V{R~ + ^^)]- 

An alternative approach is to cut out in paper a sector of a circle 
of radius L and arc zttR. This sector can be bent to fit exactly over 
the slanting surface of the cone. Now if 6 is the angle in this sector, 
measured in radians, then the arc is Ld and the area is \L^6. Thus 
Le — 27 tR, whence 6 = zttRIL, and the slanting area = ^L\2nRjL)= 
ttRJL. 

(5) Area of a sphere of radius R (Fig. 11.16). 



Let O be the centre of the sphere. It is convenient to take a fixed 
point P on the sphere, which we may call the “north pole”, and cut 
the sphere into thin strips by “circles of latitude” surrounding P. Let 
e be the angle subtended at O (or “difference in latitude”) between P 
and such a strip. "I hen if the strip corresponds to a difference 80 in 0, 
its width will be RhO (see Fig. 11.16). Furthermore its radius will 
be R sin 0 , and therefore the length of the strip will be 2ttR sin 0 , and 
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its area {zttR sin 6) (RhO) = zttR^ sin 6 BO. Now 6 varies from o at 
the north pole P to tt (= 180°) at the south pole and so we see that 
the total area obtained by adding all strips is 

zttR- sin $ dd = [— zttR- cos oy 

= [- ZnR%- I - I)] 

= 47ri?®. 


PROBLEMS 

(1) Show that the area of a parallelogram is the product of the 
length of the base times the height (measured at right angles to the 
base). 

(2) Show that the area of a triangle is half the product of the length 
of the base and the height. 

(3) Find the area of the segment of a circle of radius R cut off by 
a straight line at distance a from the centre. 

(4) A goat is tethered in a circular field of radius by a chain 
of length L attached to a point on the circumference of the field. Find 
the area over which the goat can graze. Taking R — ly plot a graph 
of this for different lengths L of the chain. Estimate the length L for 
which the goat can graze over half the area of the field. 

(5) Find the area of a cylinder by taking strips parallel to the 
axis. 

(6) A cylindrical birthday cake of radius R and height H is cut 
by two plane cuts through the axis at angle 6 (radians) with one another. 
Find the surface area of the piece cut out. 

(7) A “frustum” of a cone is the part cut out by two planes per¬ 
pendicular to the axis. If the radii of the two ends are R^ and Ro and 
the thickness of the frustum is 7 ’, find the area of the sloping part. 

II. 10 Volumes 

Most volumes can be found by cutting up into thin slices of uniform 
thickness, adding together the volumes of the slices, and taking the 
limit as the number of slices increases indefinitely and their thickness 
tends to o. This gives a definite integral. 

The volume of a thin slice based on a flat or cur\'ed area A and of 
thickness w is approximately wA. This is almost obvious: but it can 
be proved by cutting the slice into small nearly rectangular parts 
(Fig. II.I/)- (The parts near the edge may give irregularly shaped 
parts, but these will have only a small total volume, as is practically 
obvious from the diagram, and could be rigorously proved under 
suitable conditions by a more complicated argument.) Now each of 
these parts, if we take the thickness of the slice into account, is a nearly 
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Fig. II. 17—The volume of a thin slice 


rectangular block: its volume is therefore the product of its thickness, 
breadth, and length, i.e. the thickness w times the area of its upper 
face. If we add all these blocks together we get a total volume ^ w 
times the area A. of the face. The smaller w is, the nearer this formula 
will be to exact truth. 


EXAMPLES 

(x) The volume of a cylinder of radius R and height H (Fig. 11.18). 



Fig. II. 18—The volume of a cylinder or prism 


Again consider the cylinder cut into thin strips by planes parallel 
to the base. If a typical strip is at height x, and of thickness Sx, its 
volume is the product of Sx times the area of the base, = ttR^ bx. 'I’he 
total volume is therefore approximately 2* ttR^ 8x, or exactly 

ttR^ dx = rrR^H. 

J 0 
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A similar formula holds for the general cylinder or prism: its volume 
is equal to the product of the area -A of the base times the height H. 
This is true even if the cylinder or prism is “oblique”, i.e. if it is the 
volume swept out by the area A moved parallel to itself (Fig. 11.18), 
provided that the height H is measured between the two end faces 
and perpendicular to the base. The formula for the area of the sloping 
surface of an oblique cylinder can be shown to be the height H times 
the circumference of the base. The simplest method of showing this 
is to cut it into strips parallel to the axis: the details are left to the 
reader. 

(2) The volume of a cone of radius R and height H (Fig. 11.19)* 



We again imagine the cone cut into thin slices by planes parallel 
to the base. If the vertical depth of any slice below the vertex is y, 
then its thickness is the change in y between its upper and lower faces, 
i.e. hy. The radius r of the slice is given by rjy = R/H, i.e. r = yRjH. 
The volume of the slice is the area ttt- of the upper face times the thick¬ 
ness Sy, i.e. TT^yRlH)- 8y. The total volume of the cone is therefore 
approximately E 7T{yRIHy By summed over all slices, or since y varies 
from o to H, it is accurately 

I/= /"ir^yRIHY-dy 

= ^RIHY y^-dy 

= n(RIH)^liy^« 

= \^(RIHfH^ = inR^H. 
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A similar argument applies to any cone or pyramid, including an 
oblique cone or pyramid (Fig. 11.19). Such a cone is defined as the 
volume enclosed between a plane base of area A and all the straight 
lines joining points on the circumference of ^ to a fixed point P called 
the “vertex” of the cone. If the perpendicular distance of P from the 
plane containing the base A is H, then the volume of the cone is ^AH. 
(Its area is much more difficult to obtain.) 

(3) The volume of a sphere of radius R (Fig. 11.20). 



Fig. I Z.20—Construction for determining the volume of a sphere 


Imagine the sphere cut into concentric spherical slices. Let us take 
any one such slice, of inner radius r: then its thickness is the change 
in r between the inner and outer surfaces, i.e. Sr. The area of the inner 
face is 47ir*, by the formula for the area of a sphere. I he volume of 
the slice is accordingly approximately 47rr'^5r, and by summing and 
taking the limit in the usual way we find for the volume of the sphere 


the integral 


4Trr- dr = 

y 0 ® 


PROBLEM 

(i) Show that the volume of a “parallelopipedon”—i.e. a box with 
three pairs of opposite parallel faces, as in Fig. 11.21, but not necessarily 



Fig. ZI.21 —A parallelopipedon 
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rectangular—is the product of the area of the base and the perpendicular 
distance between the base and top surface. 

II.II Length 

The length of a curve is calculated by cutting it up into small 
portions, or arcs, finding the approximate length of each, and summing 
these. The smaller the arcs are into which the curve is cut the more 
nearly their approximate sum will approach the true length of the 
curve: and in the limit the length will be expressed as an integral. 

Consider first a plane curve, C, which we shall take as the graph of 
a function y plotted against x (Fig. 11.22). Take two neighbouring 



Fig. X1.22 —Calculation of the length of a plane curve C 

points P and Q on this curve: let P have co-ordinates (:r, y) and Q 
(.V + S.v, y + Sy). If P and Q are sufficiently near together then the 
arc PQ, of length hL say, is ver}' nearly a straight line. Its length is 
therefore given by Pj'thagoras’s theorem (see Fig. 11.22), 

{hLf {hxY + {hyf 

= (S:v)= . [i + ( 5 y/S.v)=] 
i.e. hL ^ S.v . y/[i + ( 5 y/S.v) 2 ] 

Now when P and Q are near together the quotient 8y/8.v is very 
nearly equal to the derivative Dxy\ or Vx as we shall write it for the 
sake of brevity. Thus SZ-c- S.vv/[i +yj:"]- 

The total length will be obtained by summing the lengths of all 
these small arcs. It is therefore approximately 27 8.v\/[i d-y*^]* of 

accurately J v [i -hy*^] • We have finally to fill in the limits of 

integration. If the arc begins at a point at which the v-co-ordinate is a, 
and ends with .v-co-ordinate b, then a and b are the required limits 
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(since we are integrating with respect to x), and the length of the whole 
curve is 

L = [^ V[i + y^~] • 

J a 


EXAMPLES 

(i) Find the length of the arc of the parabola jy = from .v = o to 
a; = I- Here jy* = 201:, so that 

^ /o 

= 2 v[i + ^‘1 

= 2 . (i/2) [sinh-* 2.x -h {xj i) \/(i + -x-)]/, 

= i [sinh"^ 2 + v^2o] 

(2) Find the length of the catenary y = cosh .v from .v = o to 
X = 1. 

Here jy* = sinh x, so that 

L = V[i + 

J 0 

= cosh .V . dx 

J 0 

= [sinh x]^ = sinh i. 

If we have a curve in space, and if the points are 

specified by 3 cartesian co-ordinates (.v, y, z), then the length can 

imilarly be shown to be the integral L = y*^ \/[i + yx~ + dx, 


where x = n zt the beginning of the curve and x — 6 at the end. 


11.12 Surfaces of revolution 

The sphere, circular (right) cylinder, and circular (right) cone are 
all examples of “surfaces of revolution”, the sort of surfaces which 
can be produced by a lathe. We can look on a sphere as produced by 
a semicircle rotated round its diameter, which is held fixed. Similarly' 
a cylinder can be produced by rotating a rectangle round one side, 
and a cone by rotating a right-angled triangle round one side (not the 
one opposite the right angle). The general surface of revolution will 
be obtained by rotating a plane curve C around the A:-axis, which will 
become the central axis of the surface. 

There are general formulas for the volume and area of such a surface. 
Let r be the radius of a section at a distance x along the axis (Fig. 11.23): 
the section, taken perpendicular to the axis, will be circular, and of 
area ttt^. If we take a small slice of thickness 8.x lying bctweci^ two 
such sections, its volume will be approximately Trr^Bx. 


V 
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Fig. XI.23—A surface of revolution 


It follows that the total volume contained within the surface between 
the planes a and x^h will be the limit of the sum 

Sx, i.e. it will be nr-dx. 

^I'he two parallel-section planes at distance 8 a; apart will also cut 
off a small strip of the surface. The length of this strip will be the 
circumference of the section, i.e. 27 rr; let us call its width 8L. If we 
take a section of the surface by a plane through the axis, then we obtain 
a curve, C. It is in fact this curve which we imagine rotated round the 
axis to form the surface of revolution. The width 8L of the strip is 
then equal to the length of the small arc of the curve cut off between 
.V and .V + S.v; and, just as in the case of the method of finding the 
length of a curve (Section ii.ii) we must have 

( 5 L) 2 -(S.v )2 

where 5 r is the change in the radius r when .v increases by Bx. Written 
in another form, 

V[(5.v)’ +(Sr)=] 

= V'[(5-V)= {I -h ( 5 r/S.v)n] 

— S.v . \/[i + ( 5 r/ 8 A;)“] 

S.v ^/[I -i- r/] 

where is the derivative D^r -= drjdx. It follows that the area of the 
strip is approximately znr hL =• Zttt v[i + ^x“] By adding all 

these strips together we find the total area of the surface to be approxi¬ 
mately T 27 Tr v[i -r rj^] . S.v, or in the limit exactly 

27 T /’*' r ^■"[I + r^-] . dx 

J a 

This formula gives the area of the curved surface between the planes 
^ and -v = b, and does not include the area of the two circular 

ends. 
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EXAMPLES 

(i) The area and volume of a cylinder calculated from the general 
formulas. 

A cylinder has constant radius r = R for all values of x: thus 
Tj. = o. Its length is 6 — a = H. By our formulas 

Volume V = /"** irr^dx 

J a 

= TrR'^dx 

J a 

= [7tR^x]1 = 7tR% — TtR-^G 
= 7TR\b — a) = TrRm. 


Surface area A (excluding ends) 

= /■** 277 r V[i + dx 

J a 

= /’*’ 27 TRdx 
J a 

= [27r7?.v]y = ZrrR (6 — — ZttRH. 

(2) The area and volume of a spheroid. 

A “spheroid” is the body formed by rotating an ellipse about one 
of its axes. If the ellipse is rotated about the minor (shorter) axis the 
figure obtained is an “oblate” spheroid, similar to the shape of the 
earth, which is flattened at the poles. If the axis of rotation is the 
major (longer) axis we obtain a “prolate” spheroid. Since an ellipse 
can be regarded as a circle squashed or pulled out in the direction of 
an axis, a spheroid will be the figure got by contracting or expanding 
a sphere in a given direction in a given ratio. 

Suppose therefore that the axis about which the ellipse is rotated 
has length za, and the perpendicular axis has length zb. 'I'he relation 
between x and r will then be given by the usual equation for an 
ellipse (5.28) 

x^la^ + r^jb^ = I. 

Solving this we have 

r2 = b\i — xVo") 
or r == by/{i — x^ja^) 

On differentiating this with respect to x we find 

r* = — ba-^xjy/{i — x'^ja^) 
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The values of x will range from —a to a, and therefore the volume is 

v= r 

J —a 

= 7 r/‘“ b^-{i ~x^la^)dx 

J —a 

= ^6* [* - ix^la^y. a 

= ■nb'^ [{a - ia>la^) a + JaV)] 

= [§a - (- ia)] 

= iirab^ 


The area is 


A = 


2.Tr J r H- r/) dx 

/•“ iL a/ -2—/ _i_ 

= 27r / 6 Vi — I H- 5 —; 

J -a N 1 — x^a~ 


dx 


= Z‘nb f ■\/{i — x^a”^ -f h^x^a~*) 

J —a 


-^'Sdx 


— ZTrb f Vi — x^a~\a^ — b“) dx 

J —a 




dx 


Now in this formula we have ttvo possibilities, since we have not 
stated which axis is longer, the one about which the ellipse is being 
rotated (of length 2a) or the perpendicular axis (of length 26). First 
suppose a > by i.e. the spheroid is prolate (a sphere pulled out). 
Then a~ — b- is positive. We can simplify the formula by writing 

= Hy so that the integral for the area becomes a 



a~\a^ — b^) 
standard form (p. 233) 

27 ib 


A = -jf f_^V{H-^-x-)dx 


27 Tb rnn r • i -v, 


xV{H-^ - x^)Y 


= 27TbH 


[sin- I 


+ 


W 


1 


1 


On the other hand if a < 6 we have an oblate or flattened spheroid. 

^K, 


In that case (6- — a®) is positive, and we can write ^ 


(6® - a®) 
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The integral then simplifies to the standard form 

A = + X-) dx 

2nb r ■ u 1 ^ . ^ViK 




[ 


a 


= 27TbK I sinh-i ^ 4- 

K. 


aV{K^ + a^) 


1 


1: 


PROBLEMS 

(1) Find the area and volume of a cone, using the general formulas. 

(2) A wooden ring is made by boring a cylindrical hole through a 
sphere, the axis of the cylinder passing through the centre of the 
sphere. If the length of the hole (measured parallel to the axis) is L, 
what is the volume of the wood in the ring? 

1Z.13 Summary of principal results 

(All integrals have limits of integration Xx and x^.) 


Lengths 

Circumference of circle = 2ir (radius). 

Arc of circle = (radius) (angle at centre in radians). 

General plane curve = ■y/{i 4 -j>'*^) dx. 

General curve in space = J \/{i + 4 - dx. 

Areas 

Rectangle or parallelogram = (base) (perpendicular height). 
Triangle = \ (base) (perpendicular height). 

Circle == tt (radius)®. 

Sector of circle = \ (radius)® (angle at centre in radians). 

Kllipsc = 7 T (semi-major axis) (semi-minor axis). 

Circular cylinder (excluding end faces) = 27 r (radius) (perpendicular 
height). 

General cylinder or prism (excluding end faces) = (circumference 
of base) (perpendicular Height). 

Right circular cone (excluding base) = tt (radius of base) (slant 
height). 

Sphere = (radius)®. 

Area under general plane curve = J y dx. 

Area of surface of revolution (excluding end faces) 

= 27 T J r y/{\ 4 - dx. 
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Volumes 

Box or parallelopipedon or cylinder or prism 
(area of base) (perpendicular height). 

Circular cylinder = tt (radius)2(perpendicular height). 

Cone or pyramid = ^ (area of base) (perpendicular height). 
Circular cone = I rr (radius)2(perpendicular height). 

Sphere = | -n- (radius)®. 

Spheroid = h tt (axis length) (perpendicular diameter) . 
Surface of revolution = tt J dx. 


EXAMPLES 


(i) Assuming metabolism to be proportional to amount of surface 
area, compare the rate of metabolism in man, whose surface area is 
about 260 square centimetres per kilo of body weight, with that of a 
cylindrical bacillus coli, whose size is X 2^, assuming its specific 
gravity to be i, and /x = lO"* cm. 


Volume of bacillus = 



Weight of bacillus 


v X io~*® 
2 

TT X 

2 


CC 


gram 


Surface of bacillus = 27 rr(r + ^0 = 



_ 577 X 10 
2 


-8 


sq cm 


/. Surface per gram of bacillus 

and surface per kilogram of bacillus 

Surface per kilo of bacillus 
Surface per kilo of man 


~ = 5X 10* sq cm 

TT X 10"^® 

= 5 X lo* sq cm 

= 5 __X ^ about 2 X 10® 

260 


Hence, metabolism in bacteria is about 200,000 times as quick as 
in man. It is on account of this enormous rate of metabolism and 
absorption of food material that bacterial gro^^'th is so rapid—division 
taking place at the rate of about once in half an hour. 


(2) It has been found that the average diameter of an adult’s 
pulmonar\’ air-cell -= 0-2 mm, whilst that of an infant’s air-cell (at 
birth) ^ 0-07 mm. Assuming that these air-cells are spherical, and that 
the total volume of the lungs ^ 1617 cc in the adult, and 6 'j-y cc in 
the new-born infant, find the total number of air-cells and their total 
surface in the adult and in the new-born infant. 
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Volume of single air-cell in adult 
Volume of single air-cell in new-born 


total number of air-cells in adult 


= ^7r(o*i)® cu mm 
= 0 004 cu mm 
= 377(0*035)® cu mm 
= 0 00018 cu mm 


1617 X 10® 
0*004 


= 404 X 



And total number of air-cells in 
new-born 


67*7 X 10® 
0*00018 


= 376 X 



i.e. the number is approximately the same at birth as in the full-grou-n 
adult, viz. about 4 x 10®. 

Surface of single air-cell in adult = 477(0*1)® = 0*125 sq mm 
Surface of single air-cell in new-born = 477(0*035)® = 0*0154 sq mm 

/. Total surface of air-cells in adult = 4 x lo® X 0*125 sq mm 

= 50 sq metres 

And total surface of air-cells in = 0 0154 X 4 X 10® sq mm 
new-born = 6 sq metres. 

Total surface of air-cells in new-born is about i that in the adult. 


Hence we see that whilst the volume of the infant’s lungs is only 
about 2V that in the adult, the total surface of the alveoli is as much as 
I that in the adult —showing that the gaseous exchange is more active 
in young infants, i.e. about three times as active as in the adult. More¬ 
over, as the area of the infant’s skin surface is J that in the adult, it 
is seen that the amount of gaseous interchange per unit of body surface 
is the same in the infant as in the adult. (See W. M. Feldman, Princi¬ 
ples of Ante-Natal and Post-Natal Child Physiology, Longmans, 1920, 
and Principles of Ante-Natal and Post-Natal Child Hygiene, John Bale, 
Sons & Danielsson, 1927.) 


PROBLEMS 

(1) The radius of each particle of cholesterol in a sol containing 
0*0005 gram of that substance per cc is lO”® cm. Find the total surface 
area of these particles. (Assume sp. gr. = i.) 

(2) The average diameter of a human capillary is mm; the 
linear velocity of blood in it is ^ mm per second. 

Find the volume of outflow from a capillary per second. 

(3) Experiments on animals have shown that the circulation time 
is equal to 28 heart-beats. Assuming this to hold good for man, and 
also assuming the total volume of blood in the body to be 4000 cc, find 
the number of capillaries in the human body, using the data of the last 
example, and assuming the pulse rate to be 72 per minute. 
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(4) The following has been found to be the percentage composition 
of ordinar}' bacteria: water, 85 per cent; solids, 15 per cent, of which 
I part in a thousand consists of sulphur. 

Assuming that the weight of an atom of any element = A X 8*6 X 
mgm, Avhere A = atomic weight of the element, how many atoms 
of sulphur does a micrococcus of diameter 0*15^1 (fi = ttsW ttim) 
contain ? 


II. 14 Areas with signs 

In Section 11.3 it was shown that the integral f ^ f(x)dx could 

J Xi 

be interpreted under certain conditions as the area under the curve 
y = f(x) between the ordinates x^ and x^- These conditions were 
(i) Xi < .Vo, (ii)/(A') is never negative for values of x between Vi and 
.Vo. We shall now investigate what happens when these conditions are 
relaxed. 

First we shall keep the condition .Vi < .Vo, but allow f(x) to become 
negative. 

If /(.y) is negative for all values of .v between .Vj and Vg, the integral 
/*’’ /(•^) must also be negative. This follows directly from the 

definition of the integral. Let /(.v) be the indefinite integral J /(v) rfv= 

J y dx\ then /(.v) dx means /(vg) — /(^Vi). But Dx I{x) = Ix{x) 

= /(.v), by the definition of an indefinite integral, and so I{x) has a 
negative rate of change /(.v). 'Fhis means that /(v) decreases as x 
increases, and in particular as v increases from Vj to x^ I(x) must 
decrease from /(.v,) to /(.Vj), and the difference /(.Vi) — -^(^*2) must be 
negative. 

'I'his will equally apply to other interpretations of the integral. If 
/ stands for the time, and v the velocity of a moving body, then 


J*' T dt means the distance gone, or change in position, between 

the times /, aiul tn. If v is always negative the body will always be 
moving backwards instead of fonvards. It will lose ground instead of 
gaining it, and the total distance covered will accordingly be counted 
as negative. Again if F is a force acting on a body moving along a line, 


and .V is the co-ordinate of the bodv, then / F dx represents the 

work done by the force in moving from position to .Vg. If F is 
negative the body is ahvavs moving against the force, and it has to do 
work instead of having work done on it, i.e. the work done will be 
cf)untcd as negative. 

Finallv, we can still regard the integral as the limit of a sum. If 
the inten-nl from v, to .V2 is divided into ?i strips by numbers X-^ = Vj, 
A'2, A'2, . . . A'„ ,, .v.,. with corresponding y values I'l, + 
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then the integral will still be the limit of S = 2" Fa (-Va + i — Xa) = 
Z YhX as the number of strips tends to 00 and their maximum width 
to zero. If y is always negative the sum S will be negative, since each 
term YhX is negative: and therefore the limit of S must be negative, 
or at any rate not positive. 

Consider .what this means in terms of areas. Draw the cun-'e 
y =y(x): this must now lie entirely below the x-axis. Let P be the 
first point on the curve, and Q the last, (xg, js) (Fig. 11.24). 



Fig. 11.24—A negative area above a curve 

Draw perpendiculars PU, QV onto the .v-axis. Divide the area UPQV 
into n strips by ordinates at the points A',, X.^, X■■^ . . . A’n + i* (In the 
figure ti = 4.) Then Y^hXi = FifA'a — X^) is minus the area of a 
rectangle approximately equal to the first strip, since V', is negative. 
Similarly Y^^X2 is approximately minus the area of the second strip, 
and so on: and the sum .S’ = • Z YhX approximates to minus the area 

of the whole figure UPQV. In the limit we sec that y dx — 

— area of UPQV = the area between the curve and the x-axis, but with 
a negative sign. 

From this \vc can readily see what the interpretation will be when 
y is sometimes positive and sometimes negative. Suppose, for example, 
that the curv'c y = f{x) starts above the x-axis at P, crosses it at W, 
and ends below it at Q (Fig. 11.25). Fhen the loop PUW will be 

counted as positive and the loop WVQ as negative and y dx — 

area PUW — area WVQ. 
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Fig. XX.25 —The integral of a function which changes sign 

It is clear that we need some general way of giving a sign to an 
area. How we are to do this is not immediately obvious, since an area, 
as usually understood, is essentially positive. But a consideration of 
the cases of length and angle may be helpful. A length or distance, 
considered on its own, is essentially a positive number. But when 
lengths are measured along the .v»axis of co-ordinates we speak of some 
as positive and others as negative. We do this by agreeing that a 
particular direction along the axis shall be counted as positive, namely 
from left to right; and we then say that the distance PQ is positive if 
Q is to the right of P, and negative if Q is to the left. Why do we do 
this? For three main reasons: firstly the distinction is important. 
In most cases it is essential to know not only that a point has moved 
such a distance, say 10 cm, but also which direction it has moved in: 
the consequences of motion one way may be very different from those 
the other way. Secondly, the sign convention simplifies calculations. 
If P, Q, R are three points on a line, then the relation PQ -h QR = PR 
holds whenever Q lies between P and R. If we count all distances as 
positive it will no longer be true in other circumstances, but will be 
replaced by some other relation such as PQ — QR ~ PR. But if 
distances are given their proper signs then we can write PQ 4 - QR 
= PR without exception. The third advantage is that the convention 
of signs agrees with that needed for other branches of mathematics, 
such as trigonometry' and differential calculus. 

A similar situation holds for angles in a plane. We agree to con¬ 
sider /_AOB as positive if the rotation from OA to OB is anti-clockwse. 
Again it always holds that /_AOB + /_BOC = /_AOC when the 
proper signs are used. 

Areas lying in a plane can also be provided with signs. Most areas 
are contained within some form of loop, and we can imagine this loop 
to be traced out by a moving point P (Fig. 11.26). If P describes the 
loop in an anti-clockwise direction we say that the area A contained within 
it is positive. If P goes round in a clockwise direction then the area 
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is negative, as with area B. If the loop crosses itself then those areas 
round which the loop runs anti-clockwise are positive (such as F) and 
those contained in a clockwise loop (such as A) are negative. This 
convention agrees with the correct sign for an integral, provided that 
we take the area under the arc PQ to be that contained in the loop 
PUVQP, described in that direction (Fig. ii.i, 11.24, In 

Fig. II.I the loop is anti-clockwise and the integral is positive, in 
Fig. 11.24 clockwise and the integral negative, and in Fig. 11.25 
the loop splits into two, with corresponding signs. 

With this convention it also follows that if the loops containing 
two areas have a part KL in common, but described in opposite 
directions, then we can always write area JKLJ ^ area MLKM 
ZTQ'dJKMLJ, using the appropriate signs (Fig. 11.26). 

Sometimes an area may be contained more than once within a 
loop. For example, in the case of the areas E and Z in the figure we 
see that E is contained once within the loop, but Z is surrounded twice 
in consequence of the way the loop crosses itself. Or we may look at 
it another way. We can start from the point /, go once round the outer 
part of the loop returning to /, and enclosing both areas E and Z in 
an anti-clockwise direction: we can then go round the inner part of 
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the loop, enclosing only Z, Either viewpoint suggests that the proper 
way of counting the area enclosed by the loop, with our convention, 
is to take the area of E plus twice the area of Z. This can be shown to 
agree with the law that area JKLJ + area MLKM = area JKMLJ\ 
it is simply the special case in which K. and L coincide with /. 

An area can also lie between two or more loops, as area H does in 
the figure. Here the proper course is again suggested by the figure. 
The outer loop, being anti-clockwise, encloses both H and 0 positively. 
The inner loop, being clockwise, encloses Q negatively. The total 
area enclosed in both loops will be, according to our convention, (area 
of -|- area of 0 ) — area of 0 = area of H only. 

We shall now consider what happens when we remove the restriction 

that X2 '> Xi in the definite integral y dx. We know that if 

lix) is the indefinite integral of v, then f** y dx is defined to be 
I{x^ — provided that x^ > .afi. It is natural to drop this restric¬ 

tion, and to say that y dx shall be defined as I{x^ — 
all values of .Vj and x^- We shall then have 



dx = /(jCi) — ^(^’^2) 


= - [/(*,) - /(*,)] 



(11.19) 


so that interchanging the limits of integration merely changes the sign 
of the integral. This agrees with our definition of the integral as the 
area PUVQ (Fig. ii.i) taken with the proper sign, for the integral 


f^' y dx with 

•d X • 


limits A'l and x^ 


interchanged will mean the area 


OVUP, i.e. the same area except that the containing loop is described 
in the opposite direction. It also agrees with our interpretation of 


J' ^ F dx as the work done by a force F in moving from position 
Xi to position .Vo; for f^' F dx is then the work done in moving 

J X. 


back from .Vo to .Vj, and must be equal in magnitude but opposite in 
sign. ^ 

We can extend this sign convention to cui^'ed surfaces, by fixing 
an arbitrary sense of rotation on the surface, and to volumes, by giving 
an appropriate sign to the surfaces which contain them. But to con¬ 
sider these matters in detail would take us too far off our course. The 
important point is the connection between the sign of an area and the 
sign of an integral. 
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II.15 Area of a polygon 

Using this idea of sign we can readily find a formula for the area 
of a polygon such as PQRSTP constructed by joining the points 
P> Q> R> S, T by straight lines (Fig. n.27). Let P have co-ordinates 



(*i»^x)> _y2),and so on up to T' = (-^6.>’6)- Draw perpendiculars 

PU, QVy RWy SH, TK on to tlie x-axis. Consider first the area 

PUVQ. This is y dx, where y represents the height of the 

point on the line PQ with x-co-ordinate x. But since PQ is a straight 
line, it will have an equation of the form y = Ai -f B^x, where 
and are certain constants. 'I’hey can be determined from the fact 
that P and Q lie on this line, so that 


= Ay + ByXy I 
y-i Ay -H ByX^ ) • 


(i 1.20) 


Now the area of PUVQ, with the proper sign, is 
ydx^ {Ay 4 Byx) dx 

= [a]x + \ByX%^, 

= A y{x2— Xy) 4 

= \Ay{X2 — ^1) 4 - \X 2 {Ay 4 - ByX^) — hXy{Ay 4 ’ ByXy) 
= \{xzyi — Xyy^) -h i ^ 2>'2 — 

using equations (11.20). 

In the same way 

area QVWR = hix^yz — x^ya) 4 - \x^^ — ^x^y^ 
area RWHS = lix^y^ — ^^>'4) 4 - hx^^ — ^x^y^ 

and 80 on. 
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Now in Fig. 11.27 tbe areas PUVQy QVWR, RJVHS are positive, 
and the areas SHKT, TKUP are negative. If therefore we add all 
these areas together we shall obtain the area of the loop PQRSTP 
with its sign changed: all the other parts cancel out. The reader can 
satisfy himself by drawing polygons of other shapes that this is a general 
result. Therefore 


area PQRSTP = — area PUVQ — area QVWR — area RWHS 

— area SHKT - area TKUP 


and on substituting the expressions we have already obtained, and 
cancelling out terms with opposite signs, we finally obtain: 

area PQRSTP = 

= lxy{y2—ys) + ix^iyz —yi) + i^sCy 4 —>'2) ^ 

+ ^^4Cy6 —iVa) + —y^) (11.21) 

A similar expression holds for a polygon with any number of 
vertices. If the polygon crosses itself at any point the area given by 
this formula will agree with the convention of signs explained above. 


EXAMPLE 

(i) Find the area of the triangle with vertices (4, 2), (3, 3), and 
(2, i). 

The area = U-i(y2 —y^) + 2(^-3 — .r,) 4 * hxziyi —yz) 

= I . 4 . (2) + ^ . 3 . (— i) + i ♦ 2 . (— i) 


PROBLEMS 

(1) Find by the formula the area of the square ^^^th vertices (i, o), 
(o, 1), (— I, oj, (o, — i). 

(2) Show that for the rectangle with vertices at (.Vj, yO, (xg, yi)» 
(.V2, Va), and (.V,, r2), the formula for the area reduces to the product 
(.V2 — -'hKy '— Vi), i.e. length X breadth. 

(3) Prove algebraically that if tliree points P, O, R lie on a straight 
line the area of the triangle PQR is zero. 

II. 16 Numerical integration 

Sometimes it happens that it is necessar)' to find the area of a 
figure—such as a section of a bone—which can be drawn, but not 
expressed by anv simple algebraic formula. Sometimes although a 
formula may be available it may have no simple integral. In either 
case we can resort to “numerical integration ”, that is, the use of 
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approximate methods to give a numerical value for the area or the 
integral. 

We already know one approximation to the definite integral or 

area J^ y dx, namely the sum S = Z y dx, obtained by cutting the 

area into vertical strips. For if the strips are made sufficiently narrow 
S will become very close in value to the integral. However, although 
this method is theoretically perfect, it is of little practical use as a very 
large number of strips are needed to give a reasonably accurate value. 

There are other formulas which give very much better results. The 
most suitable one for most purposes is “Simpson’s rule”, which com¬ 
bines simplicity with a high degree of accuracy. 

Suppose that the required integral is y*“ y dx, and that we are 

given a graph of y against (Fig. 11.28). The interval from x^ to 



must then be divided into an even number zn of equal intervals of 
width 8, so that 2w8 = x^ — .v,. (In the figure zti = 8.) Let the 

dividing points be X, = x„ X^. X^ - X^„^^ = x,. and let 

the corresponding values of y be Kj. • • • + 

Simpson’s rule (which we give without proof) states that 

> dx ~ area under the curve 

= + 4^2 + 2F3 + 4!^ + 2^6 + . . . 

+ 4 ^ 2 n + F’an + j] .... (11.22) 

or, in words, ^ the width of a strip times [first ordinate last ordinate 
+ twice the sum of the remaining odd ordinates + 4 times the swn of the 
even ordinates'^. 

EXAMPLE 

(i) Find the value of In 2 = x^^ dx. First let us divide the 

interval from 4 to 2 into 2 strips, so that the dividing points are 

= 1, X2 = h X3 = 2. The corresponding values of ^ = x~^ are 

^1 ^ ^2 = §» ^3 = and the width of the strip is 8 = ^. q’he 

area is therefore approximately 

i8 ( V'l -f 4F2 4- y^) = J [i 4 - j; 4 _ — -69444. 
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To obtain a more accurate answer we divide the area into 6 strips, 
with = I, ^2 = - 3 , X4 = I, up to = 2. The corres¬ 

ponding values o(y are = i, Ta = t» ^3 — t> 5^4 = ¥> 5^6 = t%» 
Tg = ri and T, = 1%, and the area is approximately J ^ . [Ti + 
4F2 -\-2Ys ^^Yi d-zTs +4^6 + -69350. As the true value 

of In 2 is *69315 to 5 figures, it is evident that this method gives a 
remarkably good result with very little trouble. 

Sometimes it is not possible to divide an area or integral con¬ 
veniently into an even number of strips in this manner. If the area is 
divided into an odd number of equal strips, then the area of the first 
three of these will be approximately fS(Y'i + 3^2 + 31^3 + ^4)* 
and the remaining area can be calculated by Simpson’s rule. If the 
values ofy, say Tj, Y., Y3, ... . are known for values Xj, X^, Xg . . . . 
of X, then we can always find an approximate value for the area, whether 
the X’s are equally spaced or not. The general procedure is to find 
the polynomial y = A + Bx -h Cx- -j- . . .. which passes through the 
points (Xj, Ti), (X3, Y2), (Xg, Fg), etc. The coefficients A, B, C, . . . 
can be found by the method explained in Section 3.8. We then take 
the curve y = A -i- Bx + C.v® -f- ... as representing the given curve 
for r to a sufficient degree of accuracy, and find the area under it by 
direct integration. In theory this method is more accurate than Simp¬ 
son’s rule, but the gain in accuracy is usually worth the extra trouble 
only if for some special reason it is inconvenient to divide the interval 
into an even number of equal strips. Some of these most accurate 
formulas are tabulated in a convenient form for strips of equal width 
by W. G. Bickley in Mathematical Gazette, 23 (1939), p. 352, and in 
Fisher & Yates, Statistical Tables for Biological, Agricultural, and 
Medical Research (4th edn., Oliver and Boyd, 1953). 

Simpson’s and similar rules apply to a smooth, continuous curve. 
They are not so accurate when the curve has sudden breaks, changes 
in direction, or a vertical tangent. When calculating the area of a 
smooth rounded loop we shall lose a certain accuracy on applying such 
a rule to the whole loop because there will be points on either side 
where the tangent is vertical (as at A in Fig. 11.29). This loss in 
accuracy can be overcome by cutting a vertical slice through the centre 
of the loop, dividing it into an even number of equal strips by ordinates 
at A’l, A’g, Xg . . . Xgn+i. If the vertical w'idths of the loop at these 
values of .v are IT„ W^, . . . IFjn+i respectively, then the area of the 
vertical slice is by Simpson’s rule 

J (horizontal width of strip) (IFi + 4IF2 + 2W3 +41^4 + ^zn + i) 

(11-23) 

The areas of the segments at the ends not included in the vertical 
slice can be calculated by dividing them into horizontal strips and 
applying Simpson’s rule. 
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FURTHER EXAMPLE 

(2) To find approximately the area of a circle of unit radius using 
this method (Fig. 11.29 

Take a vertical slice of width i centrally through the circle. Divide 
this slice into 2 strips by a vertical line through the centre. 'I'hen the 
vertical w’idths at the left-hand edge, centre and right-hand edge of the 
slice are IF, = ^2 = 2, = ^3 respectively, and the horizontal 

width of a strip is The area is therefore approximately i . A . (\/^ 4- 
4 X 2 + ■v/3) = (4 ■\/3)/3. We also divide the left-hand segment 

into two horizontal strips by a horizontal line through the centre of the 
circle, of length and (in order to apply Simpson’s rule) by two hori¬ 
zontal lines (which will have zero length) through the lowest and highest 

the strips is i\/3. 'Fhc area 
w this segment is therefore approximately J i \/3 . [o 4 x ^ + o] 

i \/ 3 - i I'e area of the right-hand segment is equal to this. 'I’he 
total area of the circle including the vertical slice and the two ends is 
th^efore i (4 + \/ 3 ) + iV 3 4 - kV 3 3'Of>5- fn fact the true area is 
^ 3 '* 42 . so that this simple estimate has less than 3 per cent error, 

aturally if we took a larger number of strips we should get a very 
much more accurate value. 

Since volumes and lengths can usually be expressed as integrals 
this gives us a convenient method of calculating them too. 
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Infinite integrals 


Imagine an interplanetary rocket being sent off directly away from 
the earth. As it moves away it will use up energy, partly because of 
atmospheric friction, and partly because of the attraction of gravity. 
Let us neglect the friction for the sake of argument. At the earth’s 
surface the acceleration due to gravity is gy and therefore if the mass of 
the rocket is M the pull of its weight is Mg. As the rocket moves 
outwards this decreases according to the inverse square law, so that 
at a distance r from the earth’s centre the pull will be MgR^/r^, where 
R is the radius of the earth. The work done in moving outwards from 
the earth’s surface to a distance will therefore be 


W = MgR^jr^ dr 

= [- MgR^lrY^ 

— MgR-{ijR — ilr2) . . . (11.24) 


Now we know that as rg increases i/rj diminishes, and tends towards 
zero when rz tends to infinity. This means that when Tz is very large, 
i.e. the rocket is a great distance from the earth, the work W will be 
very nearly equal to MgR'^jR = MgRy which is a finite quantity. The 
rocket only needs to use up an amount of energy MgR to get as far 
away from the earth as it chooses. [If we assume that M — 10,000 kg, 
g = 9 8 metres/sec^ and R — 6,500,000 metres, we see that such a 
rocket would need about 6 4 X 10“ joules to escape from the earth.] 


Thus the integral W = MgR~jr^ dr tends to a finite limit 
MgR as rg tends to infinity. It is natural to express this as 



MgR^jr^- dr. 


This expression is known as an “infinite integral”, and is to be under¬ 
stood as the limit of the finite or ordinary integral MgR^jr"^ dr 
in the above sense. 

Another example of an infinite integral is the amount of work done 
by an electric charge in moving away from a fixed charge Here 
the law of force is again the inverse square law, F — e^ezl^o t-\ as the 

distance increases from to rg the work done is F dr 

r 

fdr = [r^^ — As r^ 

increases indefinitely this tends to the finite limit F dr — 

Another example would be the decay of a radioactive element. The 
radiation R from this will tend to decay according to an exponential 
law: R = ce~^\ where t is the time and c and K are constants. The 
total radiation given off bet\veen times and will therefore be 
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f dt = [— cK-^ = c{e-^*' — e-^‘-)IK. As tends 

to infinity this remains finite, tending to the limit R dt = 

ce-^*'IK which accordingly is the total amount of radiation emitted 
in the whole of time after the instant tj. 

We can also have integrals such as y dx, meaning 

J —e© 

I'm y dx, 

- — — 00 •' 


and y _^ y i/jf, meaning lim lim y dx. 

if . —►—00 •*-►00 ^ 


EXAMPLES 


(i) Find f 


dx 


i + x"^ 
dx 


We have , = [tan-' jcV'* 

= tan-'.Vj — tan“'.v,. 


Now as ATj —>• — 00, tan-' x^ (= — 90°), and as X2 —*• 00, 

tan-' X2 —> J-TT. Thus 

-» = i’' - (- ^ 

(This example shows how the number n, defined in the first instance 
as the ratio of the circumference of a circle to its diameter, can enter 

into calculations which at first sight have no connection whatever with 
circles.) 

(2) Find f** dx 

J —60 

y” e^dx = [e^Y^^ 

But as X, — 00, e*' o, so that in the limit dx — i. 

J —ee 


= = I — 


PROBLEM 

Find the following integrals: 

/, /I (3) secha:(/;e. 

xi*i8 Change of variable in a definite integral 

In Sections 10.2, 10.4 and 10.5 wc have found rules for the calcula¬ 
tion of indefinite integrals. These rules apply equally well to definite 
integrals. Consider, for example, the rule for multiplication by a 
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constant. Let J(:c) — J y dx be the indefinite integral of and 

J(x) = J ky dx the indefinite integral of ky. Then we know that 
J(x) = kl(x) -{- C, where C is an arbitrary constant. It follows that 


y'*’* kydx^ J{x^ — J{xi) (by definition) 

= mx,)-\-C]-[kI(x,)+C] 

= k[I{x,)~I{x,)] 

= k y dx. 

J «i 

Multiplication of y by a constant also multiplies its definite integral 
by the same constant. 

In the same way 

r“ {y Si) dx^ r'‘ydx zdx 

J X\ J it\ Xj 

(addition rule); 

if Z is any indefinite integral of a:, 

/*; dx = - /; dx 

(“integration by parts”). 

The rule for change of variable in an indefinite integral is 


dx 


fydx=fy- 


dz 


When vve come to put in the limits of integration we must remember 
that in the right-hand side we are integrating with respect to ar, not x, 
so that the limits for a will be Zi and z^ (say), the corresponding values 
of z when x = Xj and a: 2 respectively. 


dx 


r:yd=^=r:y ■ 


(”•25) 


There is also a further rule which applies only to definite integrals: 

y dx + y dx -f- y dx = y^*‘ y dx (i 1.26) 


i.e. the sum of the integrals over the ranges from x^ to a? 2» from x^ to 
.V3, from ^3 to .V4, is equal to the integral over the whole range from 
Xi to .^4. Interpreted in terms of areas it means that the area under a 
curve between (say) P, (A:j,yi) and Q, (.V2, jaX the area between 
Q and R, (.Vg, yg), plus the area between R and S {x^yy^) sums to the 
total area between P and S. It also follows from the definition of a 
definite integral: 

y dx + y*” y dx y dx 

= [llx,) - I{x,)]\ [l(x,) - /(X2)] + - lix,)-] 

== — A^i) = /*' y 
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(Although we have taken the case of three integrals for the sake of 
illustration, this is evidently a general rule.) 

But there is one point where some caution is necessary in applying 
these rules, especially the one for change of variable, and that is when 
we have many-valued functions. We know, for instance, that 

y (i + dx = tan-‘ x, and therefore (i + x-)-^ dx is by 

definition (tan“* X2 — tan“^ .v,). But tan~* x is a many-valued function; 
tan-i o can be o, or tt, or 2tt, or any multiple of it. On the other hand 

/a! ^ perfectly definite quantity, being the integral 

of a single-valued function, so that we are limited in the possible values 
we can take for tan-' x^ and tan-' x^ if we are to make their difference 
equal to the integral in question. The correct way to look at the 
problem is this. If I{x) is any indefinite integral of a function _>-, the 

definite integral y dx means the change in the value of I(x) when 

X changes in value from jc, to x^. We must imagine x as moving con¬ 
tinuously from Xi to .Vg, passing through all intermediate values on 
the way: if I{x) starts with some definite value /(.v,) it will then change 
continuously and end with some definite value /{x^). It is the difference 
between these two determinate values which equals the definite integral. 

For example, the integral / (i + dx = tan-> a; can be illustrated 

geometrically as follows. Let X be the point on the j:-axis with co¬ 
ordinate X, A!’, that with co-ordinate .v,, and that with co-ordinate 
Xz. Furthermore let O be the origin, and H the point (o, i) at unit 
distancevertically above O (Fig. 11.30). 'Fhen by definition l_OHX=e 


H 



Fig. 11.30—Determination of the value of tan- • x in an integral 


(measured with its proper sign) is one value of tan-* a*. Here we mean 
by ^ the angle lying between - - 90^ and = go'^ as luT 

gested by the diagram. Now so long as X moves continuously up and 
^wn the A-axis d must remain within these limits. In particular as 
X moves from ATj to X^, 0 = tan-* x goes from tan-* Aj /LOIdX 
to tan-* A2 = ZO//A2. both angles lying between — and Att and 
the integral = tan-* x,- tan- a, = ^OHX^-^OHX, = Z^X.HX, 

I his fixes the integral completely. ^ ^ 
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In the same way if we change the variable from x to Zy we must 
make sure that as x changes continuously from Xi to X2 so z changes 
continuously from Zi to Zz'y and it is as well to notice carefully how 
z changes. 


EXAMPLES 

(1) Evaluate fxcosx-dx. 

Here the natural substitution is to put z = x^y so that dzfdx = 2x, 
dxjdz = 1/2.V. Then, applying this to the indefinite integral, 

J X cos x^ dx j X cos x^ (dxjdz) dz 

= h f cos z dz 
= i sin z 

Now as .V increases continuously from i to 2, sr increases from = i 
to 2^ = 4. This is accordingly quite straightforward, and we can insert 
the limits of integration i and 4 for z. 

y^“ .V cos .V“ dx = [i sin s]* 

= i sin 4 — ^ sin I. 

(2) Evaluate ^ -v cos x^ dx. 

As before, putting .v- = z, we have J .v cos x- dx = \ J cos z dz 

- i [sin z]. But now as .v increases from — i to i, xr at first decreases 
from I to o and then increases again from o to i. If we want to play 
safe we can split the range of integration for ;; into two parts, firstly 
the part in which z decreases from i to o, and secondly the part in 
which z increases again from o to i, saying that 

y* .V cos -v- dx = y® i cos dz + y^ i cos sr dz 

= [i sin o — i sin i] + [J sin i — ^ sin o] 

— o. 

Actually in this example this is erring on the side of caution, since 
the integral h sin ;r is a single-valued function, and we shall be justified 
in a straightforward substitution 

y ^ .V cos x^ dx = [i sin z] [ 

i sin I — i sin 1 = 0. 

(3) Evaluate x V.\- — i dx. 

% ^ 

Here the obvious substitution is sr = .x- — 1, so that dx^'dz = 1/2.V. 
For the indefinite integral we have 
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/ AT Vx^ ~ I dx = f X Vx^ — ijzx . dz 

= i f \^z .dz 

= * . §^ 3/2 ^ 

Now when x = 2, 2: = 4, and when a: = 2, = 4, and by direct 

substitution we have 


xVx^-i dx = [ 5 ^ 3 /^]; = o, 

and this is wrong. For if we look more closely at the integral we see 
that as a: increases from — 2 to 2 it passes through a range of values 

for which a:^ — i is negative, so that — i does not exist. The 

‘‘integral” of x\^x"^ — i cannot therefore be evaluated between the 
limits — 2 and 2, and our apparent answer o is purely a fallacy. 

(4) Evaluate \/x~ -\-x^ dx. 

Here the obvious transformation is to take z = .x-. 

Thus for the indefinite integral we have 


f -y/x^ + x* dx = J 


Vx ^ + a: 


dz 




f Vjv* + a:< 

= - dz. 

J 2X 


Now since ^/{x^ + a:*) means by convention the positive square root 
we have 

If a; > o, v'(-v“ T x*)lx = y'(i + :v2) 

If a: < o. V{x^ + x*)lx = — -^(i 4. ^3). 

If a: = o we have an indeterminate result: but this will not affect the 
integral (just as a possible indeterminacy of one single point of a curve 
will not affect the area under the curve by any finite amount) It 

therefore follows that we must consider negative and positive values of 
X separately and write 

/-, v'*" +x'dx== /”_ Vx^~T^^dx + /' V:ii~+T'dx 

(by 11.26) 

= /*=-! — ^x^dz + J Vi +x^dz 

=^— \ z dz 

since the values — 1,0, i of .r correspond to the values 1,0, i of ir = .v-’ 



312 BIOMATHEMATICS 

It is now natural to make the further transformation i -}- ar = w. Here 


dz/dzo = I, Vi + ar = v'®'* the integral becomes 


~ h dtv ^ ■\/w dw 


= - i . I . + i . t . 

= — ~ V8] + h [ V8 — x] 

= § [ V8 - I]. 


PROBLEMS 

(1) Find r^xe-^^^dx 

J *i 

(2) Find J xe-^^^ dx 

(3) Find ^x y/i + jc® dx. Compare the result with example 
(4) above. 

(4) Find .V cos x^ dx. 



CHAPTER TWELVE 


ACCELERATION: GREATEST AND 

LEAST VALUES 


12,1 Acceleration 

Imagine a point P moving up and down a fixed straight line. The 
distance OP of P from a fixed point O will be called y. This distance 
^ we shall suppose measured with the appropriate sign, that is, positive 
if a: is on one side of O, and negative on the other side. We shall call 
the time t, so that ^ will be a function of / and can be plotted graphically 

distance y 
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against t. If the law of motion y = — t we obtain the graph 

shown in the top portion of Fig. 12.1. This shows that the point P 
moves forward, slows down, stops, goes backward through O again 
(at time t — o), again stops, reverses and once again passes through O 
in the forward direction and gradually gains speed. 

The velocity v of the moving point can be calculated by differentia¬ 
tion. V = Dty =3/2 — 1 since y = P — t. At each time t there will 
be a velocity v given by this formula, and we can plot a graph of v 
against i (Fig. 12. i, lower diagram). In interpreting this graph remember 
that a positive value of v represents a for%vard velocity, and corres¬ 
ponds to an upward slope of the graph of y against t. A negative 
velocity v means a backward motion, and corresponds to a downward 
slope of the y graph. With this in mind we see from the v graph that 
the velocity decreases continuously, through zero, to a negative value 
of — I when / = o, and then gradually increases again, becoming 
positive after a short time. 

We can now speak of the “acceleration” or rate of change of the 
velocity zk This will be D{v \ since = 3/2 — i in our case, the accelera¬ 
tion is DtV = 6/. If the distance y is measured in metres, and the 
time t in seconds, then the velocity v will be measured in metres per 
second, and the acceleration D^v in (metres per second) per second, 
or, by a convenient abbreviation, in metres per second^ [m/sec^]. 

EXAMPLES 

(1) If the law of motion is y = t- find the acceleration. 

By differentiation v = D^y = 2/, DtV = 2. Thus the acceleration is 
now constant, and equal to 2 in appropriate units. 

(2) If V = cos /, find the acceleration. We have v — D^y = — sin /, 
D^v = — cos t = — y. Thus in this case the acceleration is equal in 
magnitude to the distance gone, but opposite in sign. The further the 
point P moves from O the more rapidly it then accelerates towards O. 
ff’he result is a series of oscillations backwards and forwards through O, 
as shown by the graph _y = cos t (Fig. 5.9). 

(^) Ify = €'find the acceleration, v =. Dty = DtV = e*. Thus 
in this case the distance from O, the velocity and the acceleration are 
all numerically equal, though they will be expressed in different units, 
'i'he further the point is from O the faster it is moving away, and the 
faster it is accelerating. 

PROBLEMS 

Find the acceleration a corresponding to the following laws of 
motion: (i)y = 2/ 4 - (2) y = ijt, (3)1' = In t, (4)y = 1/(1 + P). 


Since the acceleration DtV is the derivative of v it will be represented 
in the v graph by the slope—upwards for a positive acceleration and 
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downwards for a negative acceleration or retardation. In terms of the 
original graph of y the velocity v corresponds to the slope, and there¬ 
fore the acceleration corresponds to the rate of change of slope. If 
the acceleration is positive the slope will become more and more 
upwards—or in other words the graph is curving upwards. On the 
other hand if the acceleration is negative the graph curves downwards. 
This can be seen in Fig. 12.i. The acceleration is 6 t and is therefore 
negative if t is negative and positive if / is positive. So they graph is con¬ 
cave downwards up to the time / = o, and thereafter concave upwards. 

Now the velocity v is the derivative of the distance y. The accelera¬ 
tion is therefore the derivative of the derivative of y, or as we shall 
say, the “second derivative” of y. Since v = D,y, we can write the 
acceleration of DfV as Z)j(Z),y); but this is usually contracted in writing 
to Z)< 2 y, just as the product yy is contracted to y“. Each commonly 
recognized way of writing the first derivative Z),y will give rise to a 
notation for the second derivative. Thus if we write v as Z)y, we shall 
write the acceleration a as D^y\ if v is written y', a will be written as 
y'\ and if v is written y„ a will be written y,, [not asy,-*, as that would 

mean Z)('y, i.e. dyjd (Z^)]. Finally if v is written as ^ or 

acceleration will be y* symbol being used as an alterna- 

tive to Dt. This last symbol is often written and this may be 

dt^ ^ 

considered as the standard way of writing the second derivative 
(“second derivate”, or “second differential coefficient” are alterna¬ 
tive names). Unfortunately there seems nothing to be said in favour 

d^y 

of the notation except that it is commonly accepted; it does not 

look very natural, and if taken at its face value can lead to errors.* 
We shall not use it in this book. 


• As an example consider a point moving according to the law y - - f*. Wc have 

already shown that = Dty « zt^ = 2. Also — DyV 1 and - ss 

7/ - , \<iy/ 

.. d^y d^y dy* , d*y , 

« o. Now j/a = therefore 2 « o, which is absurd. 

The fallacy lies in the suggestion made by the notation that we can cancel out 
d^y dy"^ d^y 

the dy^ in to get When we tr>' to do such a cancellation we get an absurd 

answer. But it we write it properly as £)>»>■ (D/y)* there is no lonKcr any temptation to 
contract if fallaciously to Df*y. But this condemnation docs not apply to the ordinary 




way of writing the first derivative as This can be quite helpful. It suKRcsts the 


correct formula = 


dv: dtv dy 


dt ^ dv dt change of variable, and also reminds us that 7' is 
d ^ St 

cry nearly c<iual to ^ when Sy and Sf are smalL 
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From the acceleration we can go on to further derivatives. The 
rate of change of acceleration D^a will be the “third derivative” Dfyj 
and this when diflferentiated again will give us the fourth derivative 
and so on. These can be easily calculated when we have a 
mathematical expression giving y in terms of t. But although it is easy 
to visualize the meaning of the first two derivatives, the velocity, or 
rate of change of position, and acceleration, or rate of change of velocity, 
it is much more difficult to form any clear mental picture of the rate of 
change of acceleration; while for still higher derivatives it is practically 
hopeless. They are quantities exactly defined in mathematical terms 
and useful in calculation but rather difficult to grasp imaginatively 
except in the vaguest manner. 

FURTHER EXAMPLES 

(4) If _y — find the successive derivatives of y. 

The velocity v = D^y = 4^^. The acceleration is the rate of change 
of velocity — Dcy = 4 X 3/^ = i2i®. The rate of change of accelera¬ 
tion = D^y = 4 X 3 X 2 ^ = 24^. The fourth derivative D^^y = 
4X3X2X 1 = 24. Since this is a constant the fifth and all sub¬ 
sequent derivatives will be zero, 

(5) y = find the successive derivatives of^'. 

Since — e* = v, we have Dt^ = e*, and all succeeding deriva¬ 
tives are equal to 

(6) If V = find the successive derivatives oiy. 

We have />,y = Ke^\ D^y = K , D,^y = 

= and in general 

(7) If V = cos t find the successive derivatives of jy. 

Dty = — sin t, Dfy = — cos /, Dt^ — sin = cos and 

the process then repeats itself, = — sin f, etc. In general Dt*^y = 
cos /, -= — sin t, '= — cos t, = sin t. 

PROBLEMS 

(5) y — cosh t find the successive derivatives of y. 

( 6 ) \f y = cos Kt find the successive derivatives of;y, 

(7) If y^ A ^ Bt 4 0= + find A"y. 

( 8 ) If jy = In (i 4- /) find Dyy. 

12.2 Maxima and minima 

We know that if a hea\w body, such as a cricket ball, is thrown 
into the air, it will rise until it reaches a certain maximum height, and 

then fall again until it hits the ground, when its height will be a mini¬ 

mum. A piece of down, on the other hand, will he blown about by 
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the wind, rising and falling for quite a time until it comes to rest. In 
general any function y oi t considered for a certain range of values 
of f (say from t = t^xo t = will have one or more maximum and 
minimum values in this range. 

We have already introduced a scheme of classification for such 
maxima and minima. A value of y which is greater than all other 
values in its immediate neighbourhood is a local maximum. If it is 
greater than all other values without restriction, or at any rate not 
less than any other, it is an absolute maximum. Generally speaking 
therefore an absolute maximum is necessarily a local maximum, but 
the converse need not be true. (We could in theory have a value which 
would be an absolute maximum and equal to all the neighbouring values, 
so that it would strictly speaking not be a local ma,\imum. But such 
a case is unlikely to occur in practice, and not difficult to deal with if 
it should occur.) 

We shall concentrate our attention on the local maxima and minima, 
firstly because they are of interest on their own account, and secondly 
because the simplest method of finding the absolute maximum is to find 
all the local maxima and test their values individually to see which 
is the greatest. Accordingly we shall usually omit the qualifying word 
“local”, speaking simply of “maxima” and “minima”. 

A further classification can be made. A maximum or minimum 
value which occurs at the beginning or end of the curv'e will be called 
“terminal”, one occurring within the range of values will be “inter¬ 
mediate”. Thus in the graph of y shown in Fig. 12.2, y has a terminal 


0 



absolute minimum at P, an intermediate absolute maximum at O, 
another intermediate maximum at T, intermediate minima at R and Zf, 
and a terminal maximum at V. ' 

The terminal maxima and minima are as a rule ea.sy to deal with. 
If the function is increasing at the .starting-point of the curve, P, as 
in Fig. 12.2, then P will be a minimum; if the function is decreasing 
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then P will be a maximum. Thus we have the simple conditions for 
the initial point of the curve, t = /j, that Dty > o corresponds to a 
terminal minimum, and Dty < o to a maximum. At the final point, 
t tzi the conditions are reversed, Dty > o corresponding to a maxi¬ 
mum and Dty < o to a minimum. 

Now consider the intermediate maxima and minima. Near a 
maximum point, such as we see that the curve must be rising on 
the left-hand side of Q, and falling on the right-hand side. This can 
happen as at T in the figure by a sudden change in direction. But 
more often there will be a smooth maximum as at Q. Now at such a 
point Dty cannot be positive, i.e. the tangent cannot be sloping upwards, 
for then there would be points of the curve on the right of Q where y 
is greater that at Q, contrary' to our supposition that it is a local maxi¬ 
mum. Similarly the slope Dty cannot be negative at Q. Thus the only 
possibility remaining is that Df}' = o, and the tangent at Q is horizontal. 
For similar reasons the tangent to the curve at a minimum point P 
must be horizontal: for if it sloped upwards there would be points 
just on the left of P where there would be still smaller values of y, 
while if it sloped downwards there would be such points on the right of 
P. 'rhus we have the following general test for an intermediate maxi¬ 
mum or minimum that (subject to rare exceptions like T already noted) 
the slope Dty will be zero at such a point. Conversely, generally 
speaking a point where Dty = o will be a maximum or minimum. 
Occasionally there will be a point such as S in Fig. 12.2 where the 
tangent is horizontal, but which is neither a maximum nor mini¬ 
mum, but merely a momentary halt in the general rise or fall of the 
curve. 

A similar result holds for any function y of a single variable t, 
whether / represents the time or not. The intermediate maxima and 
minima will occur at values of t for which Dty — o. 

EXAMPLES 

(i) A ball is projected upwards with velocity 10 metres/sec, so that 
its height _v alter / seconds is given by y = lo^ — 4*9^“- When does 
it reach its maximum height, and what is that maximum? 

'rhe ball clearly touches ground wheny = o, i.e. t ~ o or i — 10/4*9 
= 2 04 seconds. These are the positions of minimum height: between 
them there must be at least one maximum. Now Dty = 10 — 9*8/; 
so that we have a stationary’ point when DtX = o, i.e, t — 10/9-8 = 
1-02 seconds. Since this is the only stationary point, and since there 
are no sudden changes in the slope /),y, it follows that this must be 
the intermediate maximum point. Substituting in the equation 
y = 10/ — 4 9^" ^ve find the maximum height to be 5*1 metres. 

This problem could also be solved by the method of “completing 
the square” of Section 3.4, since y is a quadratic function of t. The 
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method of differentiation is, however, more general, as we shall see 
from the next example. 

(2) A farmer wishes to construct a rectangular sheep-fold of area 
25 square metres. What length must he make the sides so as to make 
the length of the boundary fence a minimum.? 

Suppose that the sides of the rectangle are x and y metres respect¬ 
ively, then xy = 25, so that y = 25/.V. Now the length of the fence 
must be L — 2x 2y metres. In order to find when this is a maximum 
or minimum we must first express it in terms of a single independent 
variable, say x. Using the relation jy = 2^lx we have in fact L = 2x -\- 
$o/x. We now must solve the equation D^L = o [L takes the place of 
y and X the place of t in the formula D,y = o], i.e. 2 — so/.v^ = o, 
i.e. i/x^ = 1/25. So = 5 metres, and therefore y — 5 metres and 
the enclosure is square. It remains to see whether this is a maximum 
or minimum. Now' if we make x very small, y = 25/A: becomes very 
large, and the length L = 2x 2y becomes very large. If a* is large 
it also follows that L = 2x 2y is large. Thus somewhere betw'een 
these tw'o extremes there must be at least one minimum; and since 
we have found only one stationary point, that must be the minimum 
in question, i.e. of all rectangles w'ith given area a square has the 
smallest perimeter. 


In these examples we have deduced from the genera! form of the 
curve that the point at which the derivative vanishes is a maximum in 
the first example, and a minimum in the second. But there is also a 
simple general method of distinguishing between intermediate maxima 
and minima. Suppose that y has a stationary point when / = T, so 
that then = o. Calculate the value of the second derivative D(-y 

at this point. If this is positive, it means that the slope D,y of the curve 
**^creasing around that point. Since it is zero when t = T, \t follows 
that it must be negative for values of / slightly smaller than T, and 
positive for / slightly greater than T. In other w'ords the curve is 
sloping downwards as far as the stationary point t = T, and thereafter 
slopes upwards, and the stationary point must be a minimum. On 
the other hand if D,y is negative when t Tthe point will be a maxi¬ 
mum. 'I’his can also be shown geometrically. A positive value of 
means that the curve of y is concave upwards, as at the minimum 
point /i in Fig. 12.2, whereas a negative value of Dry means concavity 
downwards, as at the maximum point Q. If Df^y o we can have 
either a maximum or a minimum or a “point of inflexion” such as 
o in Fig. 12.2; such a case will be discussed later. 

Thus in example (1) above, y = tot ~ 4-9/^ = _ g.g < o, 

that any stationary point for jy must be a maximum. In example (2),’ 

^ 7- 2x + 50/.V, /V/- = ioo/.v“ and is positive, so that the stationary 
point IS a minimum. 
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FURTHER EXAMPLES 


(3) Robertson (Child Physiology, p. 249) has found on theoretical 
grounds that the gro%vth in the weight of an infant up to nine months 
is an autocatalytic phenomenon, the relation between the weight to 
(ounces) and age t (months) being 

In za — In (341-5 — to) = K(t — i-66). 

At what age will the infant grow most rapidly? 

The rate of gro\\Th v ~ Dtzo. This can be found by differentiating 
Robertson’s equation, obtaining 

v/w + ®/(34i-5 — w) = K 

i.e. on reduction to a common denominator, 

34 i- 5 zj/[hj( 34 i -5 — w )'\ = K 

V = A:w(34i- 5 — w)/34i-5. 

Now the maximum rate of growth will occur when — o. But 

D^v -= . DiW = A:(34i- 5 — 2w)7;/34i-5 

This will be zero when 341-5 — 2w = o, i.e. when to = 170-75 ounces. 
It remains to test whether this is a maximum or a minimum point. 
On differentiating this expression with respect to t by the product 
rule we have 


Dth' = [A734i-5]{Z),(341-5 — 2w) . v + (341-5 — 2w)Dtv} 

But 7 ^,(341-5 — 2w) ~ ~2 D^w — — 2v, while at the stationary point 
DfV o by definition. Therefore D{^v = [A734I-5] [— 2t'"], and is neg¬ 
ative at this point (since v- is necessarily positive). Thus, in fact, we 
have the maximum rate of gro\v-th. 

'The age t at which this maximum occurs is found by substituting 
the value to - - 170-75 in the original equation connecting to and /. 
We find 

In 170-5 — In 170-5 = K(t — 1-66) 
whence t i-66 months = about 7 weeks. 


(4) A cylinder has to have a given volume V. How must the 
length L and radius R he chosen so as to make the surface area A a 
minimum ? 


The formulas for the area and volume of a cylinder are 

A — 2 ttR(R -f" 7 ,) 

V 7 tR^-L 


Since V is given there is a relation between L and R, and the first 
step in finding the minimum value of A is to express it as a function 
of a single variable, say of R. From the second relation we have 
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L = Vj-nR^, and so ^ = 2.itR{R + Vj-n-R-) = zttR^ + zVjR. (Here V 
is a known constant, by hypothesis.) A maximum or minimum value 
of A will occur when DrA ~ o, that is 

^7tR — zVjR^ = o 
i.e. R^ = V/ZTT 

R = [V/zTryi^ 

whence L = Vj-irR- = 2 [K/ 27 r]'/^ = zR. Thus the length of the cylinder 
is equal to its diameter. 

To test whether this is a maximum or minimum we must find the 
sign of Dr A = 47r + ^VjR^ at the stationary point. Since V and R 
are positive, Dr^A must also be positive, and therefore this gives us 
the minimum area. 

(5) John Hunter wrote as follows; “To keep up a circulation 
sufficient for the part and no more, Nature has varied the angles of 
origin of the arteries accordingly.” Let us put the following inter¬ 
pretation on this statement. Let AB be a main trunk, of radius R, 
and C a point to be fed by a branch artery PC of radius r leaving AB 
at P (Fig. 12.3). We suppose that when R, r and the point C are given, 



Fig. X2.3—Flow of blood in a branched artery 

the point of origin P of the branch artery is to be chosen in such a way 
that the fall in pressure between A and C is a minimum. 

Let p be this fall in pressure, supposed mainly due to friction against 
the arterial walls. Then by Hess’s law p is proportional to APjR -f 
PCjr = y (say). Now draw CU perpendicularly from C onto AB, and 
let the distances CU = //, AU = u, and the unknown PU = x also 
^UPC = e. 'I'hcn AP^a - .v, PC = y/{x- f- h% so that 

y = {a ~ x)IR + (.v 2 + //-)t/r 
D^y = — !//;+ x{x~ + h~)-*/r 

This is zero when r/R = H h~) = PUjPC = cos 0 . 

Also = D^D^y) 

{x- + /i^)-^/r - + h^yi/r 

(by the product rule) 

= (x^ d- h-^)-yr . [(x'^ + h^) _ x^] 

= h%x^ + h-yijr > o 

80 that this value of 0 gives a minimum. 

V 
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We see from this that the angle 8 depends only on the ratio of the 
radii of the main and branch arteries. If this theory is correct, then 
when r is very small compared with K the branch should come off 
almost at right angles, while if r is nearly equal to i?, cos 8 is nearly i, 
and 8 nearly o. Thus a branch of large calibre can be expected to 
come off nearly parallel to the main trunk (e.g. the external and internal 
carotids). 

( 6 ) In a submarine cable with an insulating sheath the rate of 
signalling is proportional to — x- In x, where x is the ratio of the radius 
of the core to the outer radius of the sheath. What value of x gives 
the most rapid transmission ? Assuming that a nerve acts in the same 
manner, does this agree well with the observed ratio i : i *6 of the radii 
of the axon and myelin sheath? (W. M. Feldman, Proc. Physiol. Soc.y 

1923) 

If — Kx^ In X is the velocity of the impulse, then 

Dx'o = — ^(2.v) In X — Kx-jx 
= — Kx{2 In X 4 - i) 

Dx-v = — K (2 In X 4 - i) — iCx( 2 /x) 

= — K{z In X 4- 3 ) 

A maximum or minimum point will occur when D^v = o, i.e. when 
2 In X 4 - I = o, In X = — I, or X = e-^ = i/i'bS- This agrees well 
with the observed ratio i/i- 6 . The value of is then — K{ — 14-3) 
= ~ zK < o, so that this is a maximum. 

(7) When ethyl acetate is hydrolysed in the presence of acetic acid 
as a catalyst the following reaction occurs: 

CH3COOC2H5 4- H3O -^CHgCOOH 4 - CgH^OH 

The acetic acid formed thus gradually increases in amount and 
tends to cause an acceleration in the velocity of the reaction. But at 
the same time the active mass of the ester diminishes, thus causing a 
retardation in the reaction velocity. At what point will the velocity be 
a maximum? 

Let the initial concentration of acetic acid be a moles/litre, and of 
ester b moles/litre. Suppose further that x moles are hydrolysed after 
a time producing x moles of acetic acid. Then there will be (a 4 - x) 
moles of acid and {b — .r) moles of ester remaining, and the reaction 
velocity is proportional to = (a 4- — x). A maximum or mini¬ 
mum will occur when = o, i.e. (6 — x) — (o 4 - or 

x= \{b — a). 

PROBLEMS 

(i) An open tank is constructed with a square base and vertical 
sides so as to contain a given volume V of water. WTiat must be the 
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relation between depth and w-idth so as to make the expense of lining 
it with lead a minimum? 

(2) If ^ ~ for what values of t will y be a 

maximum or a minimum ? 

(3) For what positive value of / is In t a maximum ? Sketch a 
graph of the function. 

(4) What is the minimum value of /*/' for positive and what is 
its maximum value? Sketch the graph. 

(5) What number exceeds its square by the greatest number 
possible, and by how much? 

(6) Two light bulbs of 32- and 4-candIe power respectively are 
placed 3 metres apart on a table. If the intensity of illumination due 
to a source of light is inversely proportional to the square of the distance, 
what point on the line joining the lamps is least illuminated? 

(7) A window is in the shape of a rectangle surmounted by a semi¬ 
circle. If the perimeter is 6 metres, find the dimensions so that the 
greatest possible amount of light may be admitted. 

(8) A photographic lens of 25 cm focal length forms the image 
of an object. What is the minimum possible distance between object 
and image ? 

(9) A piece of wire 2 metres long is cut into two parts. One part 
IS bent to form a square, the other a circle. What are the lengths of 
the parts when the sum of the areas formed is a minimum ? 

Note. —It has been suggested that bees build honeycombs in such 
a way that they use the minimum amount of wax to contain the maxi¬ 
mum honey. 'I’his leads to a problem in calculus (see d’Arcy W 
Thompson, Growth and Form) and the solution does correspond 
reasonably well with the measured shape of the honeycomb. But 
probably this is little more than a coincidence, since it is doubtful 
whether this does save any appreciable amount of wax. 'I'here may 
be other factors determining the form of the cell. 


12.3 Improved formulas for small changes 

If j is a differentiable function of t then the ratio 

Sylit = (jv, -y,)l(l^ - t,) 

tends to the limit D,y when — tends to o. We have already 

sugg^ted that when ht is sufficiently small the difference between Sy/Sr 
and p,y may be negligible for practical purposes, and it may be quite 
good enough to say that hyjht = D,y, or 8jy = (n,y)ii. This eives an 
approximate formula for the calculation of &y. But it is not in^general 
absolutely correct, and we shall now investigate how we can improve 
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The equation hyjht = Diy will be exact if the velocity v = D^y is 
constant: for then the average velocity, as well as the instantaneous 
velocity, will always be equal to v. For the graph of y plotted against t 
will be a straight line of slope v, and the line joining any two points 
(^1. (^2» 3^2) on the graph has a slope equal to Sy/St. But this line 

must coincide with the original graph, so that Syjht = v exactly when 
V is constant. 

This suggests that the main cause for inaccuracy in the formula 
8 ^ = (Diy) St will be curvature of the graph. This curvature corres¬ 
ponds to a change in the slope of the graph, i.e. to a change in the 
velocity or rate of change D^y. This change in the velocity corresponds 
to the existence of an acceleration D(V = Df^ (or a second derivative, 
if t represents some variable other than the time). 

Now when the velocity is changing the simple formula Sy = (Dty)8i 
is ambiguous, since it does not state explicitly at what time t we are to 
calculate the velocity (or rate of change) The simplest choice is 

to take the time t to be We shall denote this by a suffix i, writing 
Vi — (/ 3 ,v)i for the value when t = The first approximation to the 
value of Sy — jg — 3 'i is therefore 

— • • • (12.i) 

and this formula is exactly true on the assumption that the velocity is 
constant and always equal to(Z)ty)i. It is sufficiently nearly true if the 
velocitv does not change appreciably between times and 

'I'his suggests that for our next approximation we should take the 
acceleration a = D^y to be sensibly constant, as follows. 

12.4 Motion with constant acceleration 

'fhe equation D^y ~ \\i = a involving the second derivative is 
called an “ordinary differential equation of the second order”. In 
general any equation connecting /, v, v = Diy = Vt and a = D^y — yu 
is a second-order equation. Such equations are of many types and 
will not lie discussed in detail here. But the special type y^ ~ a is 
easily solved, whether a is a constant or not. We can get from the 
acceleration a to the velocitv r by integrating with respect to /, since 
a is the derivative of r; and we can then get from v to y by a further 
integration. 

= Cl + J a dt 

where C, is a constant of integration; 

y = Co + J v dt = Cn -\- Cd q- f(J<^ dt)dt . (12.2) 

where C2 is a second constant of integration. Thus there are two 
integration constants in the solution of a second-order equation. This 
is a general property, true for all such equations. S.uch constants can 
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be determined if we are given, in addition to the original equation, 
the position y at two different times, or the position and velocity at 
one time, or any two similar items of information. 

If a is constant, then J a dt = at, J{j a dt)dt = J at dt ^at~, 

so that the equation (12.2) becomes 

_y = C2 + Cjt + iat" . . . (12.3) 

We can interpret the numbers and C2 as follows. When t = o, 
y'=^2> i-e. Cg is the initial position and y — is the distance 
travelled. Also on differentiating (12.3) we obtain v = yt = C-y + at, 
so that when f = o, zj = Cj = the initial velocity. Thus (12.3) can be 
written 

y ~ Ci = distance gone = (initial velocity)/ + \at' 

(a formula to be found in many text-books on mechanics). 


examples 

(i) Suppose the acceleration a = e*, what is the position y} 
By (12.2), since J a dt e\ J{J a dt)dt = e\ 


y — C2 + c^t + e* 

(2) Suppose the acceleration a = 1//2, what is the position y? 

J a dt — — ijt, f{f a dt)dt = — In /, 


so that 


y = C2 + C,/ — In /. 


p R o B r. E M s 

(1) A body is moving with acceleration a — sin t. Supposing that 
It starts at time / = o from position y = o with velocity v = i, what 
IS its position at time /? 

(2) A body has acceleration (i + /); find a formula for its position. 


Now from our point of view it is the time / = / j rather than the 
time / = o which is our starting-point. So it is convenient to chanee 
th^ variable from / to T - / - /,. For when / - t„ T = o, and when 


Now if Q is any function whatever of /, we know that Qr = O.t-,. 
(Section 8.11). But / = /, -f- T, so that /y = 1, and Q-j< = Q., That 
to say, differentiation with respect to T is the same as differentiation 
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with respect to t. It follows that, whatever Q may be, Qj-t — {Qt)t = 
{Q^)t = Qti, and similarly Qttt ~ Qttt> and so on for all higher 
derivatives. In particular, ify represents the position, theny^ = yT~ ^ 
is the velocity, and ytt — yrr = « is the acceleration. Now suppose 
that at time t = ii the body has position yj, velocity = (y<)i, and 
acceleration ai = yn, which is a constant. Then we can write 

yrr = 

Integrating this with respect to T, we have 

^ — Jr — f = Cj + OiT. 

But when T — o the body has velocity and therefore from the 
above equation i.e. 

V ^ yj- = -f a^T. 

Now integrate this again with respect to T: 

y = f v dT — C2 + ViT + 


But substituting T = o we obtain yi = Cg, i.e. finally 

y = Vi + ViT + iaiT‘ 

= Ji +[j/]i(^ - ^) + - ^)- 


(12.4) 


'I'his is therefore the general formula giving the position y of a body 
moving with constant acceleration Oj, when its position yi and velocity 
t’l at time are known. At least, we have found it convenient to talk 
of “positions” _r. “velocities” v and “accelerations” a, but the same 
formula will hold for any function y of any variable t, when the second 
derivative r,, is constant and equal to 


12.5 Second approximation to a small change 

Formula (12.4) is exact when the second derivative y<t = rti is 
constant. It will be a good approximation if the time interval is so 
short that the second derivative can be regarded as substantially con¬ 
stant. It can also be written 

V —y, = 

If we put t 1 then T — = S/, Vo “ Vi “ ^y> so that this 

becomes 

Sv---Vo — Vi = + Irt,(S/)2 . . (12.5) 

This is the second approximation to 8y, and differs from the first 
approximation only in the inclusion of an extra term, namely 
— ^D’o]i (^0^ representing the effect of the acceleration. 
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EXAMPLES 

(1) A cubical box of side i cm expands under heat to a side i i cm. 
What is its change in volume ? 

The relation between volume V and side ^ is V = x^, whence 

= 6x. Therefore 

where the suffix i indicates that we take the value when x — i, i.e. 
(^*)i = 3 i (f^xx)i = b. But Sx = -oi, whence 

SF 3 X -oi + ^ X 6 X ( 01)2 
= -0303 

to the second approximation. The first approximation would be 

SV (F*), Sx = 3 X OI = 03. 

Actually here we can find the true value for SF, for the final volume is 
i-oi®= 1*030301 and the initial volume = i, whence SF=s 
1*030301 — Is *030301 exactly. Thus the second approximation will 
suffice for most purposes whereas the first has about 1% error. 

(2) Find In i*i. 

Let y = \n t, = I, T = t ~ 1. Then by (12.4) 

3' = In (i + T) + [yt\\T + h\yti\\T^- 

But 3', s i//, ytt = — i//®, so that putting t = i 

= 0> LVfJi = L [yu]i= — iy and 
in(i + r- 

If we put T = I we obtain 

In i*i 1:5: I — K'l)* = '095. 

Actually In i i = 09531, so that this second approximation is quite 
reasonable. 


PROBLEMS 

(1) Find the first and second approximations to e® *, taking y = e\ 
ti — o. 

(2) Find the first and second approximations to 

The equation (12.4) can also be applied to the problem of maxima 
and minima. Let /, be a value of t for which y is stationary, i.e. yt = 
Vi = o when t z= tx. Then any neighbouring value of y is approxi¬ 
mately y, 4- where a, is the value of the second derivative y^ 
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when t = ti. Now is always positive: so if Oi is positive we must 
have 

> 3^1 

i.e. yi is less than every neighbouring value ofy, and so is by definition 
a local minimum. On the other hand if is negative then y < ji, 
and yi is a local maximum. This justifies the rule for distinguishing 
maxima and minima given in Section 2.2. 

12.6 Orders of small quantities 

If St, say, is small, then (Sf)® is much smaller, and (Sf)® much 
smaller still. For example, if 8/ = -ooi, then (St)" = 'OOGOOI and 
__ -ooooooooi. 

It is often convenient to classify the various small changes we have 
to deal with into “ orders of smallness”. We can select one of them, 
say St, and call it a “small quantity of the first order”. Any quantity 
which is of about the same size as St, such as 2 St, or -rr-St, or St, will 
also be considered as of the first order. Any quantity which is com¬ 
parable in size with (S/)®, such as 3 ' 5 ( 8 t)", is considered as the second 
order; one comparable with (St)® is of the third order, and so on. 
Naturally this is rather a rough division: there is no exact line separating 
the first order from the second, or the second order from the third, 
but all the same it is very useful. For we may have a good idea of 
what accuracy we require in our calculations—possibly small quantities 
of the second order can be neglected for our purposes, possibly only 
third-order quantities. But if we know that, say, second-order quantities 
are negligible, then they can be omitted from our calculations without 
appreciable loss, and often with considerable saving of effort. Thus 
we know that [i Sr]^ — i + 38/ 3(8^)® -H (S/)® exactly: but if we 

arc allowed to neglect (S/)® and (Si)® this simplifies to [i -h Si]® =: 
I •- 3Si. If we can neglect (Si)® but not (Si)® we shall have [i Si]® = 

I -1- 3Si 3(Si)- sufficiently accurately. As another example we have 

[i ; 2 Si] [2 Si 3(5i)®] -= 2 Si + 7(Si)2 + 6(Si)® exactly. 

But it (Sty is negligible this simplifies to zSt -f- 7(Si)®, and if (St)- is 
negligible it is enough to write 2Si for the product. 

'I’here are certain rules which 
quantities; 

(A) The sum of two quantities of the first order is also of the first 
order; and in general the sum of two «th-order quantities is of the 
«th order. For if .v and y are both about the same size as (Si)’* so also 
is (.V H- y). 

(B) The sum of a first-order and a second-order small quantity is 
of the first order, since the second quantity is small compared with the 
first. The same applies to the sum of an wth-order and an «th-order 
quantity; if n r - m, the sum is of the ;//th (i.e. smaller) order. 


help us in manipulating small 
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(C) The product of two first-order quantities is of the second 
order: for x 8t = ( 5 /)^. Similarly the product of an mth-order and 
an «th-order quantity is of order {m -f ti). 

(D) If S/ is of the first order and y is any function of t then the 

corresponding change 8y iny is also of the first order. For 8y . 6 /. 

But at points where the derivative D,y is zero or infinite, and near such 
points, this will no longer necessarily be true: 8^ may then be of a 
higher or lower order than St. 

*2.7 The general approximation 

The first-order approximation to _y, jy = -j- [jv,], 7 ' = yy -|- 

— ^1) will be satisfactory if T‘ is so small that it can be neglected. 
If it cannot we must use (12.4), 

which takes the second derivative into consideration. We can obtain 
the wth-order approximation, counting all powers of T up to T", by 
taking all the first 7i derivatives of y into consideration. 

Let us illustrate by considering the fourth-order approximation, 
taking into account the derivatives y„„. If we write 

^ ^ “7 we know that yt = yr. ytt = >’rr. = jrrr. yttu = 

yxTTTt since differentiation with respect to T is equivalent to differen¬ 
tiation with respect to /. Let[y,],, etc., denote the values of these 

derivatives when t = ty, i.e. when T = o. 

Now let us assume that we can take the fourth derivative yxTTT to 
be constant to a sufficient order of accuracy. Then we can denote its 
value by i.e. 

yXTTT — [yittt]i- 

On integrating this with respect to T wc get 

y’TTT = C, + [ytitt\iT. 

But when T = o, Cy = [yrTrli = so that 

y-j-TT ~ [yin]i + LVttffJiT. 

An integration of this with respect to T gives us 

yxT = -^[yitt\iT + 

Putting T = o we see that Cg is the value of y-p-j- = y^, when t = t, 
I.e. C2 = [ytt]u and 

yTT~[yti\i + [ytit]iT + {\ytttt\\T^- 

Again on integration 

yT = C3 + —- lytttt^iT^ 

2 • 3 

= [y,]. +[>.-]. 7 ’ + 

^ ■ 3 
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since C3 ^ [yt]i follows on putting T=o. FinaUy by a last integration 

y=y. +I>.]i7’ + 

approximately. This gives the appropriate formula for y when T® is 
negligible. 

The general pattern is now clear. To get the nth order approxima¬ 
tion we continue the series 


yx+-[y;\iT+^htt]iT^+ 


1.2 


1.2.3 




1.2.3.4 




as far as the term in T”. 

It is useful to have a symbol for the product i . 2.3.4 . . . . (n — i)» 
of the first n integers. This is called “factorial 71“ and is written |n or 
72!. Thus 


|£ = 2! === I . 2 = 2. 

13 ^ 3! == I .2.3 = 6. 

|_4 = 4! I . 2.3.4 = 24. 
j_5 = 5!=i.2.3.4.5 = 120. 

|6—6! = I.2.3.4.5.6 = 720. 

12 = 7! = !.2.3.4.5.6.7 = 5040. 

|8“8! = I.2.3.4.5,6.7.8 = 40320. 

[9 = 9! = I.2.3.4.5.6.7.8.9:= 362880. 

110 = 10! = I . 2.3.4.5.6.7.8.9 . 10 = 3628800. 

Notice that [3 == i . 2.3 = 3 (i . 2) = 3 [4 = i . 2.3.4 = 

4 (i . 2 . 3) = 4 |3, and so on, and in general 

j 72 = 71 |« — I . . . . (12.6) 

This relation enables us to attach a value to [O. For if we suppose 

|0 to be defined in such a way that (12.6) remains true when 72 = i, 

we have ji = i ]o. But 11 = i, and therefore we must set ]o = i. 

This is a little unexpected, but we shall find that it fits in with a great 
number of formulas, where otherwise a special exception would have 
to be made. 
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We can therefore write in general 

y^yi + ,7 [yt]iT-h^ (>a]i7^+ . . . + ~[DryhT- 

I— l_ 1 ^ 

+ if • • (>2.7) 

a-, 11 

« I 

(or 2 — [Dt y\i{t — if we interpret Dt^y to mean y). 
a=0 1^ 


examples 

(i) Find an approximate formula for i/t near ti ®= i. 

Put T = t — ti = t — I, then i = i + T. Also writejy = i/t, so that 
3',= = (-i)(-2)/-3 = = 

~ — in general Dty = ( —i)*' \r So when t = ti 

= I. 3 'i = I. [yt]i = — I, [yii]i “ [yddi = — |^ and in general 
= ( —i)’’ Substituting these values in (12.7), 





"T 


\ 




T — 7^ . -I- (— 


| 3 _ 


For example, the successive approximations to i/i-i are {T = -i), 

^ X — ‘I = - 9 , I — I + (-1)2 = - 91 , I — -I + (-1)2 _ (.x)3 = .gog 

* — (‘ 0 ^ — ( x)® + ( x)^ = ’9091, and so on. As i/i i = the 

recurring decimal 909090 . . . , we see that successive approximate 
values rapidly approach the true value as a limit. 


/ = /x = 


o. 


(2) Find an approximation to e* near 
Put y = e\ and T t — t^ = t. 

Now by successive differentiation y, = e* — y^, = yf^^ = 
that the values jj, [jy,],, L>'rf]i, • . w'hich are obtained by 
^ = ^1 = 0 are all equal to i. Thus 


« . » SO 

setting 



= I + /V|2 -f /V|3_+ - ■ . + ^"/|^ 

(3) Find an approximation to antilog / when t is small. 

We know that antilog t = where M = log e = 4343 

Iherefore we have merely to substitute t/M for t in the result of the 
last example, obtaining 


antilog I |2 + |3 + ■ • • + |« 
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By taking n, the order of the approximation, sufficiently large this can 
be used to calculate antilog t to any desired degree of accuracy. 

PROBLEMS 

Find approximate formulas for the following functions for small 
values of T: 

(1) In (I + T). 

(2) log (i + T). 

(3) siri 

(4) cos T. 

(5) + ^) as far as the term containing T*. Use the approxi¬ 
mation to calculate 

12.8 The general rule for intermediate maxima and minima 

We can apply the formula (12.7) to more difficult cases of maxima 
and minima. If is a value of t which makes {yt]i zero then (12.7) can 
be written 

y^yi -\-[ytt]i(t — tiYI\^ + lyttt]iit — ^)Vi 3 + • • • 

Now if [jVtJn second derivative at is positive, we know that we 
have a minimum, and if is negative, we have a maximum. We 
therefore consider only the remaining case when = o. In this 

case y ^ >’1 + [>’ —/i)®/|3 as far as the third order of small 

quantities. Now (/ — fj)® can be made positive by choosing a value 
of t > tiy and negative if we take t < ti. Thus the term[y<tji(f—^i)V |3 

can be made to have either sign by choosing t to lie on one side or the 
other of provided that is not zero, and so we can find values 

of y greater than and values of _v less than y^. yi is accordingly 
neither a maximum nor a minimum. 

If the third derivative is zero at as well as the first and 

second derivatives, we must go on to consider the fourth-order approxi¬ 
mation 

Now {t — is always positive: so if Lvntji is positive we must have 
y > y^ for any value of near Thus yi is a minimum. If on the 
other hand is negative Vj is a maximum. If is zero we 

go on to the fifth-order approximation, and so on until we come to 
a non-zero derivative. 

The general rule is: 

(i) If the first non-zero derivative when < = is of odd order we 
have neither a maximum nor minimum; 
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(ii) if the first non-zero derivative is of even order and positive 
then is a minimum; 

(iii) if the first non-zero derivative is of even order and negative 
then yj is a maximum. 


PROBLEMS 

(1) Find the stationary points of y = 2 -1- 7/ + 9^2 -|_ -^3 
investigate whether they give maxima or minima. 

(2) Find the stationary points of y = and investigate 

whether they are maxima or minima. 

(3) What is the general rule for a terminal maximum or minimum ? 


12.9 Limits of error in approximation 

It is always helpful when using an approximate formula to know 
how large an error we can expect to find. We can do this fairly easily 
for formula (12.7) in the following way. 

Imagine a hare, a tortoise and a snail having a race. Suppose they 
start off together, but that the hare always runs faster than the tortoise, 
and the tortoise faster than the snail. Then at all subsequent times 
we shall expect the hare to be in the lead, the tortoise second, and the 
snail third. Elementary, my dear Watson ? 'I’hc result is common- 
sense and true, but a formal proof is a little tricky, because of the 
rather complicated nature of the definition of a velocity as a limit. So 
we shall simply take it for granted. 

In this analogy the tortoise corresponds to the function y we arc 
studying. The hare and the snail correspond to specially constructed 
functions between which we can expect y to lie. 'rhere is one obser¬ 
vation on this principle which may be made here for the sake of clarity. 
In saying that the hare is running faster than the tortoise we mean more 
precisely that the difference (velocity of hare minus velocity of tortoise) 
IS positive, where the velocities are taken with their proper signs. Jt is 
not good enough to say that the absolute value of the velocity of the hare 
IS greater than that of the tortoise irrespective of sign. For if the hare 
runs rapidly to and fro, but the tortoise creeps steadily forward, it is 
possible for the tortoise to outrun the hare. 

Consider now any function y of a variable t. Let v = y, be its 
rate of change. Denote by the symbol the greatest value of v in 
the range of values of / we arc considering, and its least value bv v 
so that <v<, for all values of v. 

We now define two functions, the “snail” v„..„ = v. A- v . (t _/ ^ 

and the “hare” Then wh^t 7 = r. ','1} 

three functions take the same value y,. But D,\> „ = v . D v 
‘"wax, Uty = V, so thaty„„„ has always a smaller rate of increase than v 
and y always a smaller rate of increase than It follows by the 
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“ hare, tortoise, and snail” principle that for all later values of /, y lies 
between ymin and y^axl that is 

yi + ^^az{t - h)- 

By a similar argument if f working backwards from 

J'l + - h) >y >yx + - ^)• 

In any case y always lies between the two values yx + “/i) 

and yx + — ^i)- This can be considered as a precise expression 

of the approximate formula, y csy'i + Vx(t — ti). 

This argument can be extended to the wth-order formula to show 
that if is the least value of [/>(">'] over the range of values of t 

we are considering, and [Pt'y^max the greatest value, then y always lies 
between 

(>^1 + b’i]i T + bull TVlf + . . - + 

and {yx + [yJi T -f- [y„], rV|2 + . . . + [Dry-\^ax T"/|h). 

EXAMPLE 

(i) If y = cos /, what is the greatest error that can occur in using 
the formula 8y — sin . ht provided that 18 /| does not exceed -oi ? 

If jy = cos t then y^ = sin t and y^* = — cos /. We know 
therefore that y^ must lie between 

>‘l [ 3 '(]l (^2 — ^l) "b ^Lv'((]mtn (^2 ^l)“ 
y'l “r[y(]i (^2 ^l) "b iL)'u]maa:(^2 ^l)^* 

'I’hat is, Sy = y2 — yx must lie between 

(— sin /i) S? + ( 50 ^ and 

(— sin tx) 8/ + hb'it]max (St)" 

so that the error in taking 8y to be (— sin tj) Bt must lie between 
' - ^ [yuU. i^f)' and - A[y„]„.ax {bty- Buty,, - - cos t, and cannot 
be greater than i or less than —i, so that the error must lie between 
— ^nd ^St)-. Since 18 /| does not exceed -oi, by hypothesis, 

the error of the formula cannot exceed A('Oi)^ = *0005. 

12.10 Repeated partial differentiation 

Let y =/ (/, m) be a function of two variables t and ti. Then we 
can find by differentiation the derivatives -Oti„y = DtV = ft(t, n) = y't 
(with respect to t keeping u constant), and i)ui*v = D^y =/«(^ «) = y'u 
(keeping t constant). In turn these derivatives themselves have deriva¬ 
tives, so that we can find four second derivatives D<(Z)ty) = «) = 

yt(, and D^{Diy) = /,„(/, w) = y(„, the derivatives of y*, and y„, and 
y„u the derivatives of y„. 
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For example if jy = / + then y,= i -zuH, = zu/^, whence 

ytt = Dtyt = 2m2, yt^ = D^yt = 4U/, y^, = Do’u = 4«^ yuu = D^yu = 
2 t^. 

The second derivative D^D,y = is also written as 


( 


-V- 

du)\dt 


> 


or 


d'^y 
du dt 


In the same way, if _y is a function of three variables, say y = 
f{ty «, v) then we can define higher-order derivatives such as 

12.11 Approximation formula for a function of several variables 

We know that if ^ — /(f, u) is a function of two variables, t and u, 
then a small change ht in t and a small change hu in u produce together 
a change 8^ in y given by the approximate formula 

8 ^ +>'„Sh. 

As with the one-variable case more accurate expressions can be found 
by taking the higher derivatives into account. As usual we proceed 
by examining the effect of changing the variables one at a time. 

Suppose we wish to know the value oi y = /(/, «) for values of t 
and u in the neighbourhood of and w,. Let us put t — /, = T" and 
^ = V. Now if as a first step we keep t fixed we can consider y 

as a function of u only, and therefore the approximation formula for 
a single variable will be applicable: 

y=f(t,u)^f(t,u,) + -/„(/,«.) [/ + -/.(/,«,) W + ... (.2,8) 


This formula relates the value of the function at {t, u) to its value 
and the value of its derivatives at (/, «,). The second step is to relate 
these values to those at (fj, m,), i.e. to keep u fixed at w,, and to vary t. 
We can then consider f{t, w,) as a function of the variable t only and 
80 the single variable formula shows that 

f(t, «,)=/('!. «.) + u,)T^ +- 




where the sufhx (,) here indicates that the values are to be taken for 
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t = and « = «i after the differentiation has been completed. In the 
same wayWi), «i), etc., can be considered as functions of t, 

giving 

/u(^ Wl) Ml) + Wi)^ + "1)^^ + • - • 

= LvJi + - [yui]iT + i [yuu]iT^ + . • • 

II !i 

/«u(^ «i) = [>’uji + . 

li 1- 

A substitution of these values into (12.8) arid a slight rearrangement 
of the sum provides the approximation formula 

+ b'Jir+bu]it/ 

E nil II 

|3 Ell 11E 11 

+ etc. ......... (12.9) 

where T = t — U = u — Mj, and the series is to be continued up 
to the point at which further terms become negligible. 

If T' and U are small quantities of the first order, then the second 
row of the right-hand side of (12.9) is made up of first-order terms, 
the third row consists of second-order terms, the fourth row of third- 
order terms, and we can proceed in this way to any desired order of 
approximation, I'he general term can be written in the form 

|^> where h and k are zero or positive integers. 

If k — o, (i.e. y differentiated zero times) is to be interpreted as 

y itself: similarly Dt^DJ^y is to be read as DJ^y when h = o. 

We can, of course, stale this equation more precisely using the hare, 
tortoise, and snail principle. But the result is not so simple as in the 
one variable case, and we shall not give it here. Such a precise formula¬ 
tion is necessary when it is essential to be quite certain that there are 
no loopholes or fallacies in the argument, but it is rather complicated. 
In what follows we shall try to put the argument in a reasonably con¬ 
vincing form, without troubling about the finer details. 
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EXAMPLES 

(1) y = Here all partial derivatives 3',. 3 'u. y’uu ytt^ etc., are 

equal to y = If, therefore, we take — 11^ = 0 we have T = t, 

U =^u, and [y]^ = OJi = (j'Ji etc. -= i, so that 

~ I + f -I- u -I- ^2^|2 + /«/[! jl + ^2^12 + /7(3 + • ■ ■ 

the approximation being carried on until the terms become negligible. 

(2) y = e“. Here all the “mixed” partial derivatives y^^i, 

etc., are zero, yt = y^ = yt,i etc. = and y^ = y^^ = y^^^ etc. = 

so that 

e* + e“ = 2 -f / + « + /V|2 + HVj2 + rV [3 + wV |3 + ■ - • ■ 


For a function of three variables y = /(/, u, v) there is a similar 
formula, 

7 =^ 3^1 +[y;\,T +[y^],U + [y,-\,V + + k[yuu]rU^ 

+ l[y..^^V^~ -\-[y.thTV -\-[y,.„],UV 

+ i -H • • • • • • (12. to) 

where T = t ~ ti, U = u — Ui, V = v — and the sufHx (j) means 
the variables t, u, v must be given the values u^, Vj after the differen¬ 
tiations have been performed. The general term is 

[DfDJDfy],T'‘U'^V^l\h ]l< \L 

PROBLEMS 

(1) Find the approximation to sin {t + u) as far as terms of the 
third order. 

(2) Find the approximation to \/(i t 2tu) as far as terms of 
the second order. 

X2.I2 Change in the order of differentiation 

Consider again equation (12.9). If T and U arc small quantities of 
the first order, and if quantities of the third order of smallness can be 
neglected, then it follows that 

+b,].r +b„,]. 7 ’f/ -i- Uy^ul^u-^ 

This formula was obtained by first keeping / constant, allowing u 
to vary, and afterwards keeping u constant and varying t. By the same 
procedure carried out in reverse order, i.c. first keeping u constant 
and later t, we obtain 

>'^ 3 '. Tb-l.T -\^[y^^U -I- K 3 'o].T^ -\ [y.^J'U -t- 

Z 
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Subtraction of the second equation from the first then gives 

—b’tu]i} TU^o 

provided that third-order small quantities are neglected. Now this can 
only be true if = o, for TU is only of the second order, 

and if it was multiplied by a non-zero coefficient it would not be negli¬ 
gible. So = [ytu]i- But the point (fj, «i) at which we evaluate 

these derivatives is an arbitrarily chosen point, and has no special dis¬ 
tinction. Therefore the equation y^t = ytu must be true in general, i.e. 

DtD^y = D^Dty . . . (12.11) 

In a second-order partial derivative the order of differentiation is 
immaterial. The differentiation with respect to t can be done either 
before or after that with respect to u, and will give the same answer. 
This is illustrated by the example j where = y^t = 4 "^* 

The equation (12.ii) can also be written/e«(w, /) = t) or 

(4)(a-(i)(£> 

in other notations. The same result applies to functions of three or 
more variables: if y is a function of /, w, and t', then j'tu = Jut- For in 
calculating either D^Dty or 0 ( 0 ^}' we have to suppose that v is held 
fixed, so that y can effectively be considered as a function of t and u 
only. For higher orders 

D,-^D„y = D,{D,D,,y) 

- DtiD^Di}') (by 12.11) 

= D,D,{Do') 

= D^Dt{D,y) (by 12.11) 

i.e. r„,f = yiut = so that partial differentiations can be performed 
in any order. 

Note, however, that here the symbol Z),y means the derivative in 
w'hich all the variables w, v . . . other than t are held fixed, and D^y 
means the derivative with /, etc., held fixed, so that written in full 
the relati^)!! would be 


iu. I' . . . I ( . f . , . .V lu.v ... J- 

('I’he relation is no longer necessarily true when partial derivatives are 
taken with other quantities held constant, and then the order of 
differentiation may be important.) 

'I'he result DfD^y = /)„/),y enters into the study of the “exact” 
differential equation (Case J. Section 10.9). The equation Myt 4 - 
A* - o is said to be “exact” if M is the partial derivative of a function 
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/ with respect to y\ and N the partial derivative of / with respect to 
that is, \i M — Dyf, N = Dtf. It follows that 

D,M = D,DJ = DyDJ = DyN. 

This shows that the condition (10.16) DiM = Z)^.V must be satisfied. 


12.13 Maxima and minima of functions of several variables 

Sometimes it is important to know when a function of two variables 
is a maximum or a minimum. For example, a craftsman might be 
ordered to make a rectangular box of given volume V\ and if it is to be 
made of expensive material, he might be further ordered to make it 
of a shape for which the surface area is as small as possible. Now here 
the length L and breadth B can be varied independently, the height H 
being then given by LBH = F, i.e. H = VjLB, and he has the problem 
of minimizing A considered as a function of h and B. 

Exactly as in the one-variable case (Section 12.2) such maxima and 
minima can be classified as “absolute” or “local” and “terminal” or 
“intermediate”. The most important types are the local intermediate 
maxima and minima, and we shall now consider these. 

Suppose therefore that y = /(/, u) is a function which has a maxi¬ 
mum when t — ti and u = m,. This means that "> y for all values 
y in the neighbourhood of y^. It follows that must still be a maxi¬ 
mum even if t alone is varied, n being held fixed: and so from the one- 
variable case we see that the partial derivative D,y -- Byj it — must 
be zero at (/,, Wj), i.e. [yilx = o (except for rare cases where there is 
a sharp corner in the graph). Similarly by keeping t fixed and allowing 
only u to vary we can show that lyj\i = o- I he same results follow 
if>', is a minimum. Thus we have the first result: 


A necessary condition for y = f(t, u) to be a maximum or minimum 
when t = ti and u = u, fr that [y,]i — o and that [yu]i ' - o. 

Possible values of and m, can accordingly be found by solving 
these two simultaneous equations: but further investigation is needed 
to confirm whether the value obtained is in fact a maximum or mini¬ 
mum. 

Now let {t, u) be a pair of values of t and u near and u^, and let 
us put T = t — /j, f/ = u — M,. If (/,, K,) is a stationary point, i.e. if 
[yt\i = o, (jy„]i = o, then from equation (12.9) 

y^-yi Tf/ -i 

as far as the second order of small quantities. 'I’hus if the expression 

is negative for all small values of T, U (excluding the case when both 
T and U are zero) it follows that_y <y^ for all points near jvj, and so 
yi is a maximum. Conversely if the expression above is always positive 
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then yi must be a minimum. If it is sometimes positive and some¬ 
times negative we have neither a maximum nor a minimum. To see 
how this works out in practice let us take a few examples. 

(1) y=t^-^ uK 

Every stationary point must be a solution of the equations [yji — o, 
Lyji = that is, 2ti = o, 2Ui — o. The only stationary point is 
therefore u^) = (o, o). 

Second differentiation gives [ytt]i = 2, [ytuii = o> [yuu]i = 2. Thus 
the approximation near (o, o) is given by 

y —yi H- l[yii\\T- -\-[yt,i\\TV + \[yu^-iU~ 

= o + r- H- o 4- ^ 

since T = t — ti = U = u — Mj = «. Here this “approximation*^ 
turns out to be exact. Now is positive except when f = o, and 
is positive except when ii — o, and therefore {fi + is positive, 
except when t =■ u ~ o. Thus y is always positive except for its value 
o at the stationary point (o, o), which will accordingly be both a local 
and an absolute minimum. 

(2) Consider an open (lidless) box of length at, breadth and 
height z. I'he volume of the box will be F = xyz^ and its outer area 
A = .vy + 2XZ + 2yz (the area of the base is xy, and there are two 
vertical sides of area xz and two of area y^r). Suppose that the box is 
required to have a given volume F; what dimensions shall we give 
it to make A a minimum ? 

If F is fixed then only two of its three dimensions .v, y, and z can 
be varied independently. In fact, since xyz = F, we have z ~ V/xy, 
and therefore A ~ xy -f 2F/y 4- 2F/.Y, when expressed in terms of .v 
and y only. I'o make A a maximum or minimum we must first solve 
the equations [A,], = [ 5 . 4 /^.v], = o, [AJji = [dA/dy], = o, i.e. 

Vi — 2 F/.v, 2 = o, or y-i = 2 F/.y, 2 
.Vi — zX^lyc = o, or x^ = zV/y^- 

Substitute the value of y^ from the first equation into the second: 

.Y, ^ 2 F/>,= = 2 Vl{ 2 Vix{^y~ = xyjzV. 

One solution of this would be = o: but that is impossible, since it 
would make = I /-ViVi infinite. Since therefore Yj ^ o, we can divide 
both sides of the equation by Yi and obtain i = x^jzVy i.e. Yi — (2F)*. 
Since ^ 2I7 yi-, we have y, = (2F)*, and since z^ = Vjx^'y, we 
have xr, — (ir')h Thus the only stationary value of A corresponds 
to the dimensions x^ = (2F)*, Vj (2I')*, z\ = (iF)^ Ai = (Io8F=)^ 
i.e. a box with square base and of height equal to half its breadth. 
Now for values of x and v near the stationary' point we have approxi¬ 
mately, writing A' = .v — y,, Y = y —yy, 

A A, -f- -b [A,,],XV + 
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But Aj.= y — 2F/a'2, = X ~ 2 .Vjy~, whence A^^ = i, 

Ayy = ^Vjy^. Substituting the values x^ = (2F)*,>', = (2F)* we obtain 
[^a:x]i ^ ^ 2, and 

A^Ai +X 2 +XY + 

The appropriate method for dealing with an expression like 
+ XY 4 - is to “complete the square” in the same way as for 
solving a quadratic equation (Section 3.4). The terms + XY 
which contain X are the same as the first two terms of (X -1- ^Y)- 
= X^ + XY ^Y', and so we can write 

X^ + XY Y^={X *y)2 + |r2. 

But {X + iYy + jg ^he sum of two squares, and is therefore 
positive except when both X + *y = o and y = o, i.e. except when 
X = o and y = o, i.e. at the stationary point itself. It follows that 
X^ + XY + y* is positive at ail points near the stationary point, 

A X- -h XY Y^ > A„ 

and therefore is in fact a minimum, and we have indeed found the 
box of least area. 

(3) y = — 4tu + 

The equations = o, [^Ji = o for a stationary point are 

2/, — 4M, = O, —4/, -f 2Wi = o 
i.e. ti = 2u, = 2ti 

whence /j = 2(2^,) = 4^,, 
i.e. o = 3/,, fi = o, and w, = 2/1 = o. 

The only stationary point is (o, o), and at this point jq = o. 

Now for any point (/, «) near this 

y^yi + +[ytuhTa + ib'uuhfy'-‘ 

where T = t — t, = t, U = u — u, u. But y,t = 2, >*,„ = — 4, 
JVuu = 2, and so the “approximation” becomes 

y ^ — ^tu + «2 

In fact this is our original formula and is exact. 'Fo find out how this 
behaves we again complete the square. The terms containing t are 

— 4«w, which arc the same as the first two terms of {t — zu)^ = 

— 4/M 4 - 4m 2 whence we have 

y = /2 — 4 /M + M* = (/ -- zuY — 3 u 2 . 

This is now the difference between two squares, not their sum. If 
M o then y t- and is positive. But if we choose / and u in such a 
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way as to make t — 2m = o then y = — 3m^ and is negative. Thus 
there are values of y both greater than the value jVi = o and less than 
it, and we have neither a maximum nor a minimum. 

We can look at the problem in this way. Suppose that^ is a function 
of t and w. Then, taking t and u as co-ordinates of points in a plane, 
we can represent y graphically as the height above this plane (Section 
3.7). We shall accordingly get a surface, which will resemble a land¬ 
scape, with hills and valleys. On this surface there will in general be 
three kinds of points at which the ground is level (i.e. y^ = o, y^^ = o). 
A maximum point for jy corresponds to the highest point of a mountain, 
where the ground is momentarily level. But as we move away from 
the peak in any direction the ground begins to slope downwards. The 
deepest point of a valley is also level, and corresponds to a minimum 
in y. But on moving away from it we begin to rise. Finally at a pass 
between two mountain peaks the ground is also level at the head of 
the pass. But there we have a choice; we can ascend to a peak or 
descend into a valley, whichever we wish. The function y = — 

4/w -H w- has such a pass or “saddleback point”. 

Functions of three or more variables are similarly dealt with. 


FURTHER EXAMPLES 
(4) y - t- -i- u- + T~. 

4 ’hc only stationary point is given by [yf]i = L>'u]i — Lvji = 2nd 
is tx = o, Ui ~ o, 7’, = o, Vi — o. Now y is the sum of three squares, 
and is therefore positive except when f = « = ?> = o: it follows that 
Vi is a minimum (both local and absolute). 


(5) V — b - 2u - 1 - 6 v — t- — 2u- — 3t»- — 2tu — 2tz\ 

'To find the stationary point we have to solve the three equations 


[Vj. 

- 0. -- 0, [\\]x - 0, that is 



- 2/, - 2», 

— 2Z' 


2 2tx 4 »] 



6 - 2 fx 

— 6r 

The 

method of solution is as follows. 

The 


~ Uf — zw we substitute this value in the second and third equations 
and obtain 

— 2 — 2//, -{- 2r, - o 
6 




2«, — 4r, ^ o 


'I'he first equation here gives //, — tq i, and on substituting that 
in the i)ther equation we obtain 4 — 2r, = o, i.e. t’, = 2. Thus 
Ux z'x I I and /j - w, -- r, — — 3. The only stationary 
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point is = — 3, Ml = Vi = 2, >'1 = II. On putting T = t — 

U = u — Ml, V = V — Vi we see that near this stationary point 

r—:vi + h[yii\iT^ + + b'tu]iTU 

^[yu^iTV +[y^:\iUV 

= yi — T^ — — 3^2 - zTU - zTV, 

We must again try completing the square. The terms containing 
Tare —— 2TU — zTV. The appropriate square is —{T + U + Vy, 
so that 

—T^~ 2 W—'iV^-zTU~ 2 TV=~{T+U-\-Vy--W~ 2 V^-+ 2 VV 

The remaining terms containing U are therefore — U^ + zUV, which 
are the first two terms of —{U—Vy, so that we can uTite 

— T^— 2 .V^—^V^- 2 TU~ 2 TV=~{T + U + Vy-{U-Vy-V^. 

Now {T^U +Vy, {U—Vy, V- are all positive or zero. Thus the 
expression — {T+U + Vy — {U~Vy — V- is always negative, except 
when T'\-U-^¥=0 and U—V=o and V=o, i.e. except when T= 
U=^V==o: and therefore 

jy - 2f/2 — 3*^2 - 2 TU - zTV < y, 

andjy, is a maximum. 

(6) y = tu + tv uv . 

The stationary point is ti — m, = = o, and the appropriate 

“approximation” is the exact formula y = tu -\- tv uv . It is not 
possible to complete the square in the usual way, since there are no 
square terms. However, we notice that t, m, and v are independent 
variables and can be given any values we like. Let us put v — o , then 
y = tu , and clearly this can be made either positive or negative by 
choosing suitable values of t and u . Thus the stationary point cannot 
be a maximum or a minimum, and must be a “saddleback”. 
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13.1 Geometric series 

In Section 11.4 we discussed “arithmetic series” such as i, 2, 3, 4 
.. or 2, 4, 6, 8 ..or .< 4 , + 2S ... in which each term is got 

from the last by adding on a fixed number or “common difference” 8 . 

A series in which each term is obtained by multiplying the preceding 
one by a fixed number r is called a “geometric series” or “geometric 
progression”, and r is called the “common ratio”. 

Thus suppose a freely multiplying colony contains originally N 
bacteria. If each bacterium doubles itself in half an hour, then after 
i hour there will be zN bacteria, after i hour 2W = 4A^, after i ^ hours 
2W = 8iV, after 2 hours z*N = 16-Y, and so on. These numbers form 
a geometric series. 

i Again consider the metamorphosis of the locust. This passes through 
five larval stages or instars, all much alike, and acquires wings in a final 
sixth stage. The head length increases from stage to stage in the follow¬ 
ing manner (A. J. Duarte, “Growth of the Migratory Locust”, Bull. Ent. 
Res., 29 (1938), 425, quoted by d’Arcy W. Thompson)— 


Table 13.1—Head lengths in locusts 


Stage 

Head length (mm) 

Geometric series 

1 

1 

I 1 

1-44 

1.44 

II 

1 1*94 

1*96 

III 

2*70 

2’66 

IV 

371 

! 362 

V 

489 

4*93 

Adult 

5-59 

670 


The geometric series given for comparison has common ratio 1*36, 
i.e. each term is obtained by multiplying the preceding one by i*36. It 
will be seen that it agrees reasonably well \rith the obser\'ed values, 
except for the adults. A similar result holds for the lengths of the femurs 
of the locust, which increase from one stage to the next in approxi¬ 
mately the ratio i : 1*43. 
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Another example of a geometric series occurs in the survival of the 
lapwing vanellus) (D. Lack, British Birds, 39, 258-264). A num¬ 

ber of these birds were ringed and set free. The numbers of ringed birds 
found dead in successive years are given in the following table— 


Table 13.2—Mortality in Vanellus vanellus 


Year 

Number recovered 

Geometric series 

I 

141 

155-3 

2 

109 

103-5 

3 1 

70 

690 

4 

42 

460 

5 

40 

30-7 

6 

19 

20-5 

7 

19 

13-7 

8 

7 

91 

9 

6 

6-1 

10 

5 

4-1 

11 

6 

2-7 

12 

I 

1-8 

13 

0 

1-2 

14 

I 

•8 

15 

0 

•6 

16 

0 

-4 

»7 

0 

-3 

18 

0 

•2 

Total 

466 

1 

4660 


Each term in the geometric series given for comparison is f of the 
preceding term (rounded off to one place of decimals), and the first term 
of the series is chosen so as to give approximately the same total number, 
466, as the observations. Allowing for the fact that the numbers re¬ 
covered must always be whole numbers, not fractions, and must be 
subject to a certain amount of chance variation, there is quite reason¬ 
able agreement. (A method of testing the goodness of this agreement is 
explained in Section 21.4.) A credible explanation of these figures would 
be that in any year i bird in 3 is liable to be killed. For then the popu¬ 
lation in each successive year would be reduced to | of its value in the 
year immediately preceding, and the number killed would accordingly 
also be reduced in the ratio i : g. 

In general if a is the first term of a geometric series, and r is the 
common ratio, then the second term is ar, the third term ar*, and the 
nth'term 
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PROBLEMS 

Find the first term a, common ratio r, and nth term of the following 
geometric series 

(1) I, -I, -01, ooi, . . . 

(2) 3 6, 2 4, 1*6, . . . 

(3) 32, -24, 18, —13-5. • • • 

Z3.2 Sum of a geometric series 

Sometimes it is useful to have a formula for the sum = a -f* 

+ ar^~^ of the first n terms of a geometric series. 

We find by direct multiplication (see equation 3.7) 

(i - r) (i + r -f + . . , + r"-i) = i — r". 

If r 9^ I we can divide both sides of this equation by (i — r), obtaining 

I + r + + . . . + r”-‘ = (i - r”)/(i - r) . (13.i) 

Multiplying both sides by a we have 

= a + or + ... -h = a{i — - r)] , . 

= «(r" - i)/(r - I)) 

This is the required formula. 


EXAMPLES 

(1) Sum tiie series i 4 - 3 + 3- 4- ... to n terms. 

Here a = i, r — 3, and 

= air- - i)/(r - i) = i(3» - i) 

(2) Sum the series i 4- J 4- i 4- 10 + . . . to n terms. ' 
Here a = r = i, and 

= a{i - r^)i(i - r) = I - (*)" 


PROBLEMS 

(1) A population consists originally of 10 animals introduced into a 
new and favourable habitat. It can be expected to double its size each 
year. How many animals will there be after n years? If each animal 
consumes 1,000 square metres of grain per year, estimate the total area 
of crops eaten in ti years, and in particular the case when n = 10. 

(2) Sum the series i — - i + *01 — -ooi 4- ... to « terms. 

(3) Establish the formula for the sum of a geometric series in the 
following way. Imagine a large population of birds (such as lapwings) 
consisting initially of N birds. Suppose that in each year a proportion r 
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of the total survive, and a proportion k = i — r are killed. Then in the 
first year Nk birds will be killed, and Nr will survive; in the second year, 
of the Nr survivors, Nrk will die and Nrr = Nr^ survive. In the third 
year, Nr^k will die and Nr^ survive. In the first n years, therefore, the 
number killed will be Nk + Nkr + Nkr^ + . . . -f Nkr'^-^ = 
k {N + Nr + Nr^ + - • • + and the number surviving will be 

Nr^. But these ttvo numbers added together must equal the original 
number of birds N. Divide the resulting equation through by yfe = 
I — r, and so obtain a formula for N + Nr + Nr- -f- . . . 4- 


X3.3 Arithmetico-geometric series 

The series ir, 2r-, 3r^ ^r*^ ... is a combination of the arithmetic 
series i, 2, 3, 4, . . . and the geometric series r, r-, . . . . Its mth term 

is mr”*. It is called an “arithmetico-geometric series”. Occasionally \N'e 
need to know the sum of such a series to n terms. It can be obtained by 
the following trick. 

Take equation (13.i) and multiply it throughout by r. It becomes 


r -\-r^ ^r^ + . , . r'^ = {r-r-^^)l{i ~r) . . . (13.3) 

Now this is an identity: it holds for all values of r (except r = i), 
irrespective of whether r is a fixed or variable quantity. So suppose 
that r is variable. Since the left- and right-hand sides of this equation 
are identically equal they must have the same rates of change; that is 

Dr{r + + . . . + r") = Z),[(r - r'*+i)/(i - r)] 

or on applying the rule for differentiation of a quotient we find (after 
some simplification) 

I + 2r + 3^2 + 4rM- . . . + nr"-^ = 

(i — r)2 

On multiplying both sides of this equation by r we obtain 

.++ 3 ^=++...+=’■- ^ .. (.3.4) 

which is the required formula. 

We can go further. We know that the rates of change of both sides 
of equation (13.4) must be equal, i.e. on differentiating with respect to r 

I 4- 2.^r + 32r2 4- 4"''=* + ■ - • + 

_ i_ 4 - r — {n 4- 1 )2 r” 4- {z n- -j- 2n — i)r''+» — 

(I — rf * 

and on multiplying both sides by r 
i^r H- 22r2 4- 32r=' + . . . +„^r^ 

= 4- 2n — i)r'*+2 — «2r'*+2 

(i-rp - 


(13-5) 
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This gives the sum of the first n terms of the series whose general (mth) 
term is which can be considered as an arithmetico-geometric series 
of a more complicated type. A further differentiation enables us to sum 
r + 2^2 + + 4V -h . . . to w terms. We can also combine these 

results to sum series whose »rth term is of the type {^A + Bm -}- Crt^ -{-. .)r”* 
where B, and C are given constants: this is a general '‘arithmetico- 
geometric” series. We merely have to add together A times the sum of 
the series whose with term is r”*, B times the sum of wr"*, and C times 
the sum of mV”* (and so on if there are further terms). From equations 
(13.3), (13.4) and (13.5) the sum to n terms is accordingly 


a’: 


yO+l 


„ r — (w + -h «r"+2 

i-r {i-ry 


^ r — (n 1)- r”+i -f + 2n — i)r"+2 — r"+® 

"T C' -r—-TT--- 


— r^3 


+ 


In the derivation of equations (13.3), (13.4), and (13.5) r has been 
assumed to be a variable. But inasmuch as these equations are true for 
all values of r (except r = i, when the right-hand sides of the equations 
become indeterminate) they are identities; and so the final results will 
still be true even when r is constant. 


PROBLEMS 

Find the sum to n terms of the series 

(1) 2r + 6r2 + izr® + . . . + ?/(m + i)r" 

(2) 6r + 2^r- -I- 6 or 3 + . . . +?/(?* + i) (« + 2)r". 

13.4 The geometric rate of growth 

One of the most striking properties of a geometric series is its rapid 
rate of growth when the common ratio exceeds i. For example, if we 
consider the series i, 2, 4, 8, . . . of powers of 2, then by the time we 
get to the nth term it has become 1024, i.e. more than 1000. In each 
further 10 terms it must again multiply itself a thousandfold: the 21st 
term must be greater than 1,000,000, and the 31st term greater than 
1,000,000,000. We are already approaching astronomical magnitudes 
although each term is no more than double the preceding one. 

In fact any get)metric series with common ratio r > i will ultimately 
grow veiy large, and will eventually exceed any arithmetic series, no 
matter how rapidly the latter increases. Compare for example the 
arithmetic series o, 1000, 2000, 3000, . . . with common difference 1000, 
and the geometric series 1-023, i'047, 1-072, 1-097 • ■ • common 
ratio 1-023 (“ to'®*). At first sight the arithmetic series seems an easy 
winner. Its tenth term is 10,000, whereas the geometric series has only 
got to 1-259 ^^5' tenth term, and must, one would tliink, be feeling 
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pretty small. The hundredth term of the arithmetic series is 100,000, 
while that of the geometric series is only 10. Each time we go on another 
100 terms, we add 100,000 to the arithmetic term and multiply the geo¬ 
metric term by 10: so the 200th terms are respectively 200,000 and 100, 
the 300th terms are 300,000 and looo, the 400th are 400,000 and 10,000. 
The geometric series is now evidently catching up. When we get to the 
thousandth term the arithmetic series has mounted to i million, but the 
geometric term is ten thousand million, and so an easy winner. 

Conversely if r is less than i the terms of a geometric series rapidly 
fade away to vanishing point. The powers of i are (i)' = -25, (})= 

= *0625. iiy = -0156. (i)-* = -0039, = -OOIO, (i)« = - 0002 , (i)' 

= -oooi, and all subsequent powers are zero to 4 places of decimals. If 
we take a value of r nearer i the diminution may not be quite so dram¬ 
atic, but even so it will not usually take very long for the terms to become 
imperceptible. In the data for Vanelliis vanellus quoted above, although 
a bird has apparently 2 chances out of 3 of surviving any one year, not 
one bird has been observed to survive more than 14 years. 


13.5 The sum to iniinity 

Consider the series I’o . . . with common ratio S. 'I’he suc¬ 

cessive sums are = ^, 5 , = J = 5, Sg = i 
*^4 = i + i 4 - i 4 - i\j = 10, and so on. Inspection of these Hgures 
shows that they arc approaching more and mure nearly to i. 'I'he 
general sum to n terms is = i — (i)”, and as n increases the second 
term (i)" soon becomes imperceptible, and the sum S„ practically i. 

Thus _ 

Inn 6,. = I . . (13.6) 

n —• 

It is natural to say that i is the “sum of an infinite number of terms” 
or “sum to infinity” of the series i 4 - ^ 4 - i 4 - lo 4 - 4 . . . and we 

CO 

can write ^ (|)“ = i. 'This form of words is not intended to imply any 

a . 1 

more than that the more terms we take of the series the nearer the sum 
approaches i. 

The sum to n terms of the general scries a -i ar j- ar- -i- ar"^ 4 - . . . 
is = a(i — r")/(i — r). If |r| < i the power r’* rapidly tends to zero 
as n 00, so that S„ approaches the limit ~ a;{i -- r). 


42 4 - ar q- ar^ 4 io infinity - 42/(1 — r) 


(13-7) 


If however \r\ > 1 the successive terms of the series increase indefinitely 
in size, and there is no “sum to infinity”. 


E X A M 1* L E S 

(i) T'hc series t'o + iln 4 - 4 “ . . ., i.e, •! 4 ' 'Oi 4- *001 4- 

•0001 4- . . . has first term a and common ratio r ,V,. Its sum 
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to infinity is therefore tc/(i — tV) — i- This justifies the represen¬ 
tation of the fraction ^ by the infinite decimal • 111111. . , . 

(2) Achilles runs at 10 metres per second, but the Tortoise at only 
•I metre per second. Nevertheless the Tortoise challenges Achilles to a 
race, giving himself 10 metres start. He says “Achilles will take i 
second to cover the 10 metres advantage I have: but by that time I shall 
have gained an additional -i metre. Achilles will take -oi second to 
cover this, but I shall then have gained another 001 metre. Achilles will 
take *0001 second to run this, but I shall still be in front: we can go on 
for an infinite number of steps of this kind, and at every point I shall 
always be a little ahead. So Achilles cannot ever catch me up.” But un¬ 
fortunately for the tortoise the total time involved in this infinite suc¬ 
cession is only i + -oi + ( 01)2 + ( 01)3 + . . . seconds, = 1/(1 — -oi) 
= 100/99 seconds, and is therefore finite. In fact, Achilles will pass 
the tortoise after this interval of 100/99 seconds. We can deduce that 
the infinite decimal l oioioi . . . = 100/99. 

13.6 Convergence and divergence 

Let .v„ .^2, .Vg, a: ,... be any series of numbers, and let — atj + X2 

n 

+ ATg + . . . + .r„ = ^x'a be the sum of the first n terms. If S„ tends to 

a BA I 

a definite limit 5 *, as n tends to 00, then the series is said to be “con¬ 
vergent” and Sgf, is called the “sum to infinity” or simply the “sum” 

00 

^Xa of the series. We also say that the series “converges to S«”. If there 
is no such limit the series is “divergent”. 

Query: Can an arithmetic series be convergent.^ 

Sometimes the solution to a problem can be expressed as a con¬ 
vergent series, but not in any simpler form. This is often quite a satis¬ 
factory solution for numerical calculation. For by summing a sufficient 
number of terms of the series we can get as near as we wish to the true 
answer. But there is one important qualification: the convergence must 
be reasonably rapid. A series may have a sum (to infinity) and yet con¬ 
verge to it so slowly that we have to add thousands of terms to get a 
reasonable approximation. Such a series would be useless for calcu¬ 
lation—for example it can be shown that the series 1 — J-hJ — i + 
5 — ... has the sum In 2, but it would be practically impossible to use 
it in that form. The convergence is too slow. On the other hand a geo¬ 
metric series does converge rapidly, provided that the common ratio r 
is substantially less than i in absolute value. Hence any series which 
tends to its final limit as rapidly as a geometric series will be ideal for 
the purposes of calculation. 
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We shall therefore say that a series a:, -f- .Vg + -scs + • . . is “geo¬ 
metrically convergent” if each term is smaller in absolute value than the 
corresponding term of some convergent geometric series; e.g. if |r[ < i 
then the series rji r^/2 + ... is “geometrically convergent”, 

because each term is smaller in magnitude than the corresponding term 
of the series r -\- . Similarly, if we can find two positive 

numbers A and r such that |r| < i and |.v,„| < Ar^ for all values of m, 
then the series -j- atj + • • . is, by definition, geometrically conver¬ 
gent. 

There are two points to notice in connection with this definition. 
The first is that the property of convergence or divergence of a series 
depends on the “tail end” of the series: what the first few terms do is 
of no importance so long as they are finite. Thus it is convenient to 
widen the definition: 

Formal definition of geometric convergence 

If there are positive numbers A^ r, and N such that 

(i) \r\ < I 

(ii) for every term x„ for which m > N (i.e. for every term from 
onwards) the inequality |Af„, | < Ar”* is satisfied, 

then the series is called “geometrically convergent”. Such a series can 
be conveniently used for numerical calculation. 

The second point is that we are to some extent begging the question 
in calling the series “geometrically convergent”, since it is not immedi¬ 
ately obvious from the definition that such a series necessarily converges, 
i.e. that the sum = atj -1 - .Vj -b . . . + of the first n terms does 
tend to a limit S„ as n -> 00. Now commonsense suggests that this 
must be so. For the essence of “convergence” is that by taking n suffi¬ 
ciently large we can make S„ approximate to S„ as closely as we like; 
i.e. the addition of any further terms to S„ will not alter its value 
appreciably. Now we know that this is true for the geometric series 
Ar -b Ar'^ -b Ar^ Not only do the terms themselves rapidly 

become negligibly small, but also the sum of any number of successive 
terms from the «th onwards is negligibly small when n is large. But if this 
is true for the geometric series Ar + Ar^ + Ar^ + ... it will be true a 
fortiori for the original series .Vj 4- x.^ . ., whose terms are 

even smaller in magnitude, at least from a certain point onwards. So 
the scries Fxa would seem to converge, and so indeed it does: but a 
complete formal proof requires techniques a little beyond the scope of 
this book. 

But, taking this for granted, how can we determine most easily 
whether a given series is convergent or divergent? There are two simple 
and useful tests. 
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(I) Let pjn = Xfn+ilx„ be the ratio of two successive terms of the 
series. Then if there is a positive number r less than i such that all these 
ratios are less in magnitude than r, or at any rate all from a certain 
term onwards, then the series T.Va converges at least as rapidly as a cer¬ 
tain geometric series of common ratio r. For this condition means that 
each term from a certain point onwards is reduced in absolute value by 
at least as much as the ratio r when compared with the preceding term, 
and so the terms decrease at least as rapidly as if each was r times the 
one before it. A formal proof runs as follows. Expressed in symbols 
the condition becomes 


\Pm\ — \Xm-nlXm\ <r < 1 whcn m> N 


(13.8) 


For = |.r„,+i|/|.v„,|. Let us put A = then 

a’yI = Ar^\ For the next term we have 




\Xn\ 


For the (N -\- 2)nd term, 

Xk+ 2 



\pn 


< 

Ar^ r (by 13.8) 

< 





% 


1 


< 


.1 



(by 13 

< 




In the same way I.V.V4.3I = l-Vv+el IP.Y+2I and so we show for 

every successive term from .yy onwards that I.v,„| < Thus the 

series Zxa converges more rapidly than, or at least as rapidly as the geo¬ 
metric series T.-lr'*. 

Examples of the use of this test will be found in the next section. 


(II) Suppose that the ratio p„. = tends to a definite limit L 

as m tends to infinity. I’hen if jLI < i the series £xa is geometrically 
convergent. 

For since ^ L we know that | /3,„| [L|. Now take r to be any 

positive number between \L\ and i. Then since the ratios approach 
\L\ as a limit, and \L\ < r, there must be some value N of m beyond 
which all subsequent |p,„| are less than r. (Otherwise they would not 
be approaching nearer and nearer to |L|.) But this is simply condition 
(I) above for the series to converge geometrically. This is very reason¬ 
able, since the condition + i L simply means that the further 

we go along the series the more nearly it behaves like a geometric series 
of common ratio L. 
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13.7 The Taylor series 

In Section 12.7 we have seen that a function y of a variable t can be 
approximated to by the formula 

y = yi-\- iyt]iT + + .. . + [Dy], 

Here is a fixed value of t and T = t — is supposed small. The 
suffix I signifies that t is to be put equal to after the differentiation 
has been performed. Here we mean that is a first approxi¬ 

mation, valid when the terms containing can be neglected; -f- 
Lvji 1 ' + ^[ytt]i is a second approximation, and so on. In general 
the more terms we take the closer is the approximation, and it is natural 
to conjecture that if we extend the series to an infinite number of terms 
we may get a perfectly exact value. 

3^ = 7i+••• 

= i7[D“y]i7'7|o .... ( 13 - 9 ) 

0 — 


This series is known as the “Taylor series” for the function y. Alter¬ 
native ways of writing it are 




'b . . . 


and 

m 


fit, + r)=/(/,) . {t - 1,) -h i nty) • {t - t,r- +. 


= /('.) +/,(/,) . T + + . . . 


according to whichever notation is convenient. 

'I'hus supposing this series to be valid, examples (i), (2) and (3) of 
Section (12.7) would give 

i/< = I — — *) + (^ — i)* — — i)® -f . . . to infinity 

^ 1 + tj\j_ -b ZV|£ -b + ... to infinity 

amilog t ■--- i + tfM + |2 + t^jM^ |3 + . . . to infinity. 

The first of these series is certainly true if \i — t\ < i, for it is a 

simple geometric series of common ratio (i — t) and first term 1 and 
therefore by (13.7) its sum to infinity is i/[i — (1 — /)] := ijt, "I’his 
being so we may be encouraged to think that (13.9) may be true under 
fairly general conditions. 


AA 
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There is an important special case of (13.9), when = o. Then 
T — t — — ty and the series can be written 


y=At)=yo+ [>,] o< + bu] o< 7 l£. + bm] o< 7 li + ■ • • 



• • 


=/(°) +/'(°) • t +r(o) . «V|2 +/'"(o). /vi3 + • • • 

(1310) 

where here the suffix q means that t must be set equal to o after differ¬ 
entiation. [Sometimes series (13.9) is called “Taylor’s series” and the 
special case (13.10) “Maclaurin’s series”. The two series are, however, 
merely different ways of writing what is essentially the same theorem.] 
Two questions arise which require answering. Suppose we take any 
given function y and expand it in a Taylor series according to equation 
(13.9) or (13.10). Will the series be convergent? And if it is convergent 
is its sum equal to the original function y} 

For the moment we shall take it on trust that if the series has a sum 
(to infinity) that sum is equal to y. But it is interesting and useful to 
find if the series does converge for several simple and important functions 
y. We shall use in every case the form (13.10) with = o. 


EXAMPLES 

(1) j = e\ Here all derivatives yt„ yttt etc. are equal to and 
therefore >‘o = [>’t]o = L>'((]o = . . - = i. The series becomes 

e* = I + ^ + t~l\z 4- /7|3 + • - . to infinity. 

The mth term — i . and Thus p„ = 

for any given value of t this tends to o as w tends 
to infinity. Therefore by condition (II) of the preceding section this 
series converges for all values of t without restriction. 

(2) y' = In (i -{- t). The successive derivatives are y^ — 1/(1 + 

yti = — 1/(1 + 0 ^ ytn == [2/(1 4- t)\ ytttt = ~ [3/(1 + 0 *» and in 

general ~ ( —i)'"-* j w — 1 /(1 + /)”*. Thus when f = o we 

obtain y^ = o, == i, etc. and the 

series becomes 

In (1 +t) = ti\_^ - \±r-i\± + |2 ^713 - . . . 

/ — t-jz -f <73 — <74 4 “ ... (by 12.6) 

The 7«th term is -y,„ = (—i)"*-* and the {m 4 - i)th is — 

( —1)"* <"*+V(ot + i). So p„ = i), and 

= m\t\/{m 4- i) < |<|. If therefore |<| < i this series converges geo¬ 
metrically by condition I of the previous section. 
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(3) y = sin t. Here the successive derivatives are y\ — cos 
yn =■ —sin ty yttt = —cos t, ytttt = sin /, and thereafter cyclically, 
cos ty —sin t, —cos t, sin t. On putting / = o we obtain the values 

^0 = LyJo = [ytt] 0 = o, —1,0,... and so on, and the 

Taylor series is 

y = sint== t ~ ty\2 + t^l [5 — ty\2 + . . . . 

The mth term of this series is , and the 

(m -t- i)*A is ( — 1)'" t^^+y\ 2 fn + I , whence the ratio + = 

—| 2w —i /l zm + i. But 

\ 2tn i = {zm + i) 1 2m = 2m (2w + i)|2m — i, 

so that Pm = —/2/[2m(2m -f i)], and this tends to zero as m tends to 
infinity. Thus by condition II this series converges geometrically for 
all values of t without restriction. 

This series can be used to calculate the value of sin t. But it must 
be remembered that here t is measured in radians. If we want sin 6 '^, we 
must use the relation i® = tt/iSo radians = H radians (say), so that 
= HO radians, and on replacing th'j HO in the series this gives 

sin 0 ^ = HQ — H^ 0 y \2 + H-’O^yi + . . . 


PROBLEMS 

Show that the following series are given by (13.10), and write down 
the mth term of each: 

(1) e~* ^ 1 — t t^l\2. + ty\2 — /Vl4 -f- . • -(for all 0- 

(2) antilog r = I + t/M + [2 + . (for all t). 

(3) cos / = I - ty\ 2 . + /Vl4 — tV\^ 4- - . . (for all t). 

(4) sinh i = t -h /V|3 + 'Vll H' 'V|7 + ■ • ■ (for all t). 

(5) cosh f = I + ty\2 + ty\2 + /V)6 4 - . . . (for all /). 

(6) In -t)= -t- tyz - ty-i - /V4 - . . . (for |/| < 1). 

( 7 ) (i - = I 4 2 / 4- 3^^" + 4'" 4-(for (/( < i). 

13*^ The binomial series 

A specially important series is that fory = (w 4- /)", 

Repeated differentiation of y gives jy, = n{u 4- 0 "“S 3 '« = nin — 1) 
X (m 4 /)'’“*> ym = n(n — i) (« — 2) (« 4- and so on. 
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On putting t = owe therefore find jv 0 “ [y t] 0 — [y«] 0 = 

n{n — i)u”-% |>«(]o = «(« — i) (n — 2) and by (13.10), 

y = (u -h 0” 

= «»+ »»"-•«+ .0 (” - «» + ... 

II |3 

(13-11) 

This is Newton’s “binomial series’*. When does it converge? The mth 
term is 

^ _ n(n-i) (n-z) . . . (n-m+2 ) 

”* |ffl —I 

and the (m + i)th term is 


^m+l 


n{n — i) (n — 2) . . . (n — m -f- 2) (« — m i) 


m 




The ratio Pm — x^+Jx^ is therefore 

n (« —i) («—a) • ■ • («—w4-2) («—OT + i) | ot—I m”-”* ^ 

«(n—i) («— 2 ) ... (tt—m+ 2 ) |OT i”*-* m u 


But as m -»■ 00, (« — w + i)/w ^ — i and therefore —tju. It 

follows by condition II above (Section 13.6) that if |—</«! < i, i.e. if 
\t\l\u\ < I, i.e. if 1^1 < |«|, the series converges geometrically, irres¬ 
pective of the value of n. For example, if we put n = —i we get 

i/(m + i) = «“* — w"* t -h ^^ ^2 _ ^ ^ ^ 

II 

= ~ / -h — u~* -f . . . 


provided that |^| < |t<|. This is in fact a geometric series with com¬ 
mon ratio r = t so that |r| = UI/|m| < i- Again if we put \ 
we have a series for the square root 


V(« +<) = “* + «■’ ^) a-J /3 4. . . . 

= v«[i + K</«) -T(<v«-) + ■ ■ •] 

In the special case when « is a positive integer or zero the series is 
simplified. We know that the mth term is 


Xm = ”(« — i) (n — 2). . . (n — m -I- 2 )m"“"‘+^ ^ 

and if tn > w + 2 this must contain a factor (n — n) ~ o, and so must 
be zero. That is, the series becomes a finite one of (n -f- i) terms only, 
all subsequent terms vanishing, and therefore it converges for all values 
of u and t without restriction. 
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For example: 

{u +tY= X 

(u ty = u + t 

{u + ty = U- + zut + 

{u + ty = t /3 + + 3M^2 _|_ iZ 

(u + ty = + 6 uH^ 4- -f 

(w H- Z)5 = 4- 5|/4f 4- lOu^fZ _|_ jou^fZ -^^^4 _|_ fS 

and so on. 

The coefficients in these powers have a number of simple properties. 
Before we discuss them, however, it is helpful to introduce a distinctive 
notation. Unfortunately there are a number of different symbols in 
use, and no general agreement on notation. But the simplest one seems 
to be due to H. E. Soper, and runs as follows. 

The expressions ti, ?;(« — i), w(h — i) (« _ 2), and so on, which 
occur in the binomial series bear a distinct resemblance to powers. The 
ordinary power /i- means n X n, the product of two equal factors: 
whereas n x (n — i) is the product of two factors in which the second 
is diminished by i compared with the first. Similarly w® = n x n X «, 
with 3 equal factors, whereas in « X (« — 1) x (// — 2) the factors 
decrease by i at each step. We call an expression like ;/(w — i) (« — 2) 
a “descending factorial”, and to emphasize its analogy with the ordinary 
power we shall write it as (w —y. I'hus (n —(«_)2 = 

— i). {n —= n(n — 1) (n — 2) and in general 

(n — y = n{n — i) (« — 2) . . . — r 4- i) - (13.12) 

[Other symbols often used for (;/—)*' are "P,., (n),, and n^ry] 

We can therefore write the binomial series as 


(u + ty = u" 





(«-) 


u 


N.2 


P H- 


Wc can also clearly have an “ascending factorial” (« !-)'■ where the 
factors rise by i at each step: 


(w4 y^n(n 4 - 1). 

(w +)3 = ti(n 4 - i) (« + 2), 

(«+)'* = «(« + r) (n 4- 2) (// 4- 3) and in general 
(h4-)'' ^ „(„ 4 i) (« + 2) . . . (« + r — i) . (13.13) 

Such an ascending factorial occurs in the expansion 


(w — /)-" = u-"-i / 4- ^ H-"- 


4 - . . . 


obtained from (13.11) by writing -« in place of t and «. The 
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ascending factorial also occurs in the theory of generalized arithmetic 
series (Section 11.6), where for example 

(1+)"* + (2+)”* + (3+)"* + . - . + (k+)^ = + i) 

Another way of writing («+)*’ is 

Other important expressions, frequently used in higher mathematics, 
are (w—) 7 [^, w 7 |^ and {it +)Vl^« Convenient symbols for these are 

(n —)r, («)r, and {n +), respectively, and they are named ^‘reduced 
descending factorials”, “reduced powers”, and “reduced ascending 
factorials”. Thus, by definition, 

(n—)r = {n-yi\r = n(n — i) (n — 2). . . (w — r + i)/|^ ] 

(«)^ = nVK = n .n .n . . . n/\^ y (i3-i4) 

(n+)r = «(« + i) (« + 2) . . . (h + r — i)/|r j 


Other symbols for (h—) r are "C^, ”(r)> other symbols 

for («+)r are "i/r and H[r]. The alternative symbols and for 


the reduced descending factorial {n —)r are of fairly common occurrence, 
and should be specially noted. 

The binomial series can therefore also be written in the general form, 
valid for any value of w, 


{u -h /)" = m” + («—)i t + (« —)a H- . . . 

We have not yet assigned any meaning to the symbols (n —)® and 
{n 4-)®. However («—)'’ and («+)'’ obey the recurrence relations 

(«-r+i = (n-r)(«-r 

(«+)-+! = („ +r) («+)’■ 

for all values of r greater than zero. If we define («—)®, (« 4 -)° in such a 
way as to make these true even when r = o, we find {n —= n{n —)®, 
(h+)* = «(«+)”: but («—)’~ («4-)i w, so that («—)® = (n+)° 
— I. Since {0=1 we therefore have 

(„_)o (,/)o = („+)o („_)(, = („)^ = (n+)o = I. 

Using this convention we can compress the binomial series into the form 

(h 4- ^ 2 (« ~)a - . (13*15) 

a = o 

and the series for e‘ in the form 

= (Oo + ( 0 i + (O2 + (O3 + - . . = (Oa 

0 

'fhe values of (w—)r are given for positive integers n in the following 
table, known as “Pascal’s triangle”. 
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Table 13.3—Pascal*s triangle 
Values of the binomial coefficients 
(«— )r = «(« — 1) (« — 2) . . . (n — r + i)/[r 
r = o 1 2 3 4 5 6 7 8 9 10 

n — o I 

1 II 

2 I 2 I 

3 I 3 3 I 

4 z 4 6 4 I 

5 I 5 10 10 5 I 

6 1615 20 15 6 I 

7 I 7 21 35 35 21 7 I 

8 I 8 28 56 70 56 28 8 I 

9 I 9 36 84 126 126 84 36 9 1 

10 I 10 45 120 210 252 210 120 45 lO I 


From each row of this table we can read off at once the corresponding 
binomial series. Thus taking the row n — 6 we have 

(u + ty = w® + + 15MV" 4- 20K^/® + 

The most striking property of this table is that each row is sym¬ 
metrical: it reads exactly the same starting from the end and working 
backwards. In symbols this means that 

{n—)^-r ^ {n~)r ■ ■ . ( 1316 ) 

This property is a consequence of the identity (w -f- /)” = (f + u)"\ 

3“*^ + 3*^^* -f- = (/ + = r® 4- 

4-1/®: we reverse the series but have the same coefficients. It can also 
be proved as follows: 


If n is an integer then 

[n—y = n{n —- i) (« — 2) . . . (« — r 4- i) (by definition) 

= — 0 • • ( ” — r i)(n — r)( n —r — i) ... 2.1 

{n — T){n — r — i) . . .2.1 


whence 


= \nl\n - r 


(n-). = («-) 7 |r = 


From this formula 


(1317) 
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By the use of equation (13.17) the binomial series can be expressed in 
another rather elegant form. We have 

(u -f ty = + (w—)i + (n—>2 . + r 

|« \n \n \n 

= ^ —!=— ^ . . . + -!=- uH^ 

\n |o \n—i |i ) w—2 [2 10 !« 

Divide this equation throughout by \n. It becomes 


{u -I- ty M" /o 


71 


71 lO 


+ 


U 


n-1 


W — I I I 


+ r 


M"-2 ^2 
« — 2 12 


+ . . . + 


o \n 


or, on using “reduced powers” (?/),. = 

(« + t )„ = («)„ (/)„ + («)„_, ( 0 . + (h)»- 2 (/)2 + . . . + («)o (13-18) 

provided that n is an integer. 

The other propert)^ of Pascal’s triangle which is worth noting is that 
each number in it is obtained by adding the number immediately above 
it to the number immediately above it and one place to the left. (This 
property enables us to construct the triangle very quickly.') That is. in 
symbolic form, 

{71 + i — )r = {n—y H- (?,—. (13.19) 

The proof runs as follows. By (13.17) 

1« 


(«-)r + («-),_! = 


\L\!L^ 


+ 


n 


r — I in — r-f-i 


We now use the relations I r 
Thus 


+ I =: (n — r + i)\n-r. 


(«-)r + (w-)r-l === 


n 


---{- 


n 


rl r — I I n — r 


I'L (l 

1 

I 

r — I 1 n — r \r 

n — 

r + 

1” 

n + I 


1 r — I 1 « — r r(n 

-r + 

0 

^ _ (« +1) 

1" 



r — I (n — r + i) |n — r 


r\ r ~ i . (« — r i) |n — r 
!« + I 
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PROBLEMS 

(1) Sho\vthat(— n —)’^ = ( — i)*" («+)’' 

(2) Show that, if n is a positive integer, (w-}-)'■ = |« + r — i/|« — i 

(3) Show that each number in Pascal’s triangle is the sum of all the 
numbers above it in the column immediately to the left. 

13.9 Correctness of the Taylor series 

We now return to the problem of determining whether the sum of 
the Taylor series for a functionjv is in fact equal toy, as it should be. 

For certain functions y the series cannot converge. The function 
y = ijt cannot have a Taylor series expansion about / , = o, for the first 
term of the series, = i/o is infinite. The same applies to In t. There 
can be no series for y = \''t about = o, for when / = o the first 
derivative y, = 1/2\/? is infinite. Similarly |/[ cannot be expressed by 
such a series, for it has no derivative when f = o. 

However when the series converges it is almost always true that its 
sum is equal toy. This is a consequence of a very general theorem (see 
Section 14.25). But in many simple cases it is possible to give a direct 
proof. We make use of the more precise form of the approximation 
given in Section 12.9 (the “hare, tortoise, and snail” argument) which 
shows that y must lie in value between 

iyi +[>(]i ^ +[jn]i TV|2 + . . . -t-[/),"y],n,>, TVl^w) 

and(yi +[y,]i T +[y((]i TV|_2 -| . . . 4 [/>,"y]r„<,;r T"/|h) 

where as usual T = t — and [/)<’'>],„.>• and [/)<"y],naa. are respectively 
the least and greatest values of D"y in the range of values we are con¬ 
sidering. We can express this fact in the form 

y = 

where =yi +[y/]i T + ■ - • + “ * = the sum of 

the first n terms of the Taylor scries fory, and R„ lies between 

[Dty],nin 7 ' 7 |«_ and [/)("y]™a* 

Now what we want to show is that y is the sum to infinity of the series, 
which by definition means that ->y as n —> 00, i.e. since R„ = y — S^, 
that R„ o. 

Let l/)|"yUa* denote the greatest absolute value of De”y; both 
Pi'y^mtn and cannot exceed this in absolute value, and so 

^t'yXnir, ^"/|" and [/),"y]m«^ cannot exceed \Dt’'y\„,ax |T[7|^in 

absolute value. Therefore R„, which lies between these values, must 
satisfy 

|/e„| < |/)i"yUa*: ITI'ViM. 
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So whenever we can prove that 

-> O as n->00(13.19) 

it follows that R„ -> o, i.e. y — ^ o, i.e. i.e. that the series 

has in fact the correct sum y. 

This condition looks somewhat formidable, but in practice is usually 
not too difficult to apply. Only a few simple cases will be considered 
here: others can be covered by very similar arguments. 

(i) The series i + f for e*, considered for values of 

t lying benveen — i and i. ^ 

Here y = e\ and = o, so that T = t. Also we have Dt^y = e\ 
and lies between ije and e for values of t between — i and i. Thus 
\^t'y\max = e. So in order to show that the series i + f + ^7j2 + . . . 

does in fact converge to the correct value y ~ e* we have by (13.19) to 
show that e\t\^l\n -> o as « —> co. But this is clearly the case, since 

|/| < I, and therefore e\t\"l\ 7 i < el\ii which -> o as n -> 00. 

A slightly more complicated version of this argument can be used 
to show that the series sums to for values of / lying between —2 and 2, 
or between —3 and 3, and so on, finally including any desired value of t. 

(ii) The series In (i -f- ^) = / — t^jz + ^^3 for values of t lying 
between o and i. 

If >- = In (i +/) then = (-i)"-^ |« - i (i + 0 ""- The 

smallest value of t is o, so the smallest value of (i + i) is i, and the 
greatest value of (i + /)-" is i-” = i. Therefore = \ n - i . 

To prove that the series has the correct sum we must show by (13.19) 
that 

\ n — 1 1^17 o 

as H CO. But |/| < I, so that < i/«t which certainly tends to 

zero, and a fortiori |/|7« must also tend to zero. 

(iii) The series for 1/(1 — t). This turns out to be i + t t- + 

+ . . i.e. a geometric series, and we have shown above that if |;| < i 
this has the correct sum 1/(1 — t). 

In such a way we can verify, though rather clumsily, that the Taylor 
series for a particular function does in fact have 3/ as its sum to infinity. 
But as we have said there is a more advanced theorem which shows that 
this must be true for all cases which arise in practice. 

13.10 Manipulations with series 

We can find the series for cosh t and sinh t in the following way. For 
all values of t we have 

e* ^ I 4. / 4. ^2/p _i_ r7|^+ _ 
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Replace t by (— f) throughout to obtain 

e-* = \ — t - • - - 

Now by definition cosh t is + e-*), and sinh t is Let 

us therefore first add the series for e* and e~* by adding together the 
constant terms, adding the terms containing and so on. We find 
e* + e~^ = 2 + oi -h 2^V|2 + oi^ + 2/^/|4 +. . . . Finally on division 
by 2 we get 

cosh t — I + /Vp + ^V|4 + ■ • ■ * 

In the same way a subtraction of the series for from that for e* and 
a division by 2 gives 

sinh t = t + |3 + 11 + - • • - 

But are we justified in treating the series in this way? If the aeries 
were finite we should be merely following the ordinary laws of algebra 
in such a calculation. But here we are dealing with infinite series, and 
what is referred to as the “sum” of such a series is strictly speaking not 
an ordinary sum, but the limit of a sum. So a little caution is required. 
However, it is not difficult to show that our operations here are indeed 
justifiable and logical. We should expect them to be so, since the con¬ 
vergence of the series means that all terms are negligible after a certain 
point, so that we can almost treat the series as if they had only a finite 
number of terms, neglecting the rest. This can be expressed symbolically 
as follows. Let be the sum of the first m terms of the series for e*, 
and S'fn the sum of m terms for e~*. Then if we include the zero terms 
in the series we have obtained, which we hope has the sum cosh /, and 
write it as 

(presumed cosh f) = i -}- 0/ + + ^V[4 + • (*3'2o) 

the sum of the first m terms of this series is S",„ = ^(S,„ -f- S'^), be¬ 
cause of the way it has been obtained. But S,„ —> as m co, since 
the first series has sum e* to infinity, and —> e-\ and so 5 ",^ = 

i(iS„ 4- S'„), which tends in the limit to e-‘) = cosh /, i.e. the 

series (13.20) does in fact sum to cosh t. In the same way the sum S”'^ 
of the first m terms of the series 

o + t + ot^ + t^/\^ 4- + . . . . (13.21) 

— S'„), and so tends to — e~*) = sinh t as m > 00. Thus 
(13.21) has the sum sinh t. 

A slightly more complicated argument shows that we can also multi¬ 
ply series. For example, suppose we wanted to calculate, for |/| < i, 

(I = -0-Mi -t)-^ 

= (I 4 - / + /M- 4 - . . .) (I 4 - r 4 - 4 - 4 - . . .) 
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Multiplying the term “i” in the first series by the “i” of the second 
gives us the constant term “i” in the product. A term containing t can 
arise by multiplication in two ways, either by multiplying i in the first 
series by t in the second, or by multiplying t in the first series by i in 
the second; this gives us i . ^ f. i = in the product series for 
(i — /)-2. There will be three products containing viz. 1 t 
. I, giving 3<“ in the (i — series. Thus by multiplying the series 
in this way we find (i — = i -{- 2/ 3 ^^ + 4^® + • • • and that is 

correct, for it is precisely the expansion we get using the binomial series. 

We can prove this result by a different argument. Write (i — 

= i and differentiate it, using the rule for 

differentiation of a sum. 

Z)((i — = Z)(i + Dtt + 4“ . . . 

This gives once more 

(i — /)-2 = I -I- 2^ 4 - 3^2 4^3 + . . . 

Again let us take the geometric series 

(l + ^2)-l ^ I _ ^2 ^4 _ ^6 4_ _ , 

and integrate it, using the rule for integration of a sum. We find 

f (i -)- dt=^ J idt — J t-dt 4- f t*dt — J t^dt . 

that is 

tan-* t — C + t ~ 4- — <7? 4- . . . (13*22) 

Putting / = o shows that C = o. 

All these operations are what one would expect to be able to do with 
series. It can be shown that they are valid for all, or almost all, series 
one is likely to meet with in practice, and in particular they are valid for 
all geometrically convergent Taylor series. For an exact discussion the 
reader is referred to books on Analysis (the theory of real and complex 
functions). 


PROBLEMS 

(1) Multiply the series e *. e-* and show that the product is i. 

(2) Taking the series t — ^V [3 4 - ^V !5 — • • • sin t, obtain the 
series for cos t by differentiation. 

(3) Multiply the series (i — ^)3 and (i — t)-^ and show that the 
product is i. 

(4) By integrating the series for (1 4 - 0 ”^ obtain the series for 
In (i -I- t). 
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13.II Uniqueness of Taylor series 

We can now show that a function y oi t can be expressed as a Taylor 
series in powers of {t — in only one way. For convenience of 
demonstration we shall take the case = o. Suppose then that we find 
two possible series for y, say 

y ^ ao + + . . . = -j- + b^t- + . . . . (13.23) 

By putting t = o we see at once that = b^. Now differentiate (13.23): 
we obtain 

yt 20 ... b^ -f- 2b nt -|- . . . 

If we put t = o we see that Oi = b^. Again differentiate: 

yit = 202 + 603/ + .... = 2^2 + 663? 4- . . . 

whence on putting ^ = o, 202= 2b 2, i.e. 02=- b 2- Proceeding in this 
way we see that = b,n for all rn, i.e. the series are identical. 

As an example of the use of this result consider the identity (i + t)’*' X 
(i 4- /)" = (i -f- /)"»+". Provided that |/| < i we can use the binomial 
series 

(i 4 - t)^ = (ffi—)o 4 - (w—)if 4 - (m—)2<- + (m—) 3(3 + . . , 

(1 4- 0 ” = )o + in—)it 4 - (h —)2^- 4 - («—) 3 '^ + • - ■ 
whence on multiplication 

(i 4 - t)^ (i 4 - 0 " = (w—)o («—)o + [(w —), (h —)o 4 - (w—)o (»—)i]^ 
+ [('«—)2 («—)o 4 -(w —)i(«—)i 4 -(/«—)o (« — 

+ [(w—)3(« —)o +("* —)2(« —)l +('«-)l («~)2 4 - ('«—)o(«—)3]''‘ 

4 “ . • . 

whereas by the binomial series 

(i 4 - 0 ”''*'"=(?« 4 -n—)o+("*+«—)i^+('"+””")2^^"b('«+"~)3^’4- • - - 

These series must be identical, so we see that 

(ot 4 -m—) o=(ffi —)o (« —)o 

(m 4 -n—), = (m —)i («—)o 4 -(w—)«(« —)i 

(m 4 -«—)2=(m—)2 («—)o 4 -(w —)i(w —), 4 - (w—)„(« —)2 

('" 4 -n—)a=(m*-) 3 («--)„ 4 -(m —) 2 («“)i 4 -(»«—),(« —)2+(m—)□(« —)3 
and in general for any positive integer r 

(m 4 -n—)r=(m—)r(«—)o 4 -(m —),._,(« — ), + ... 4 (»* —)«(« —)r(i 3 - 24 ) 

This is known as van dcr Monde’s theorem. 

Similarly by multiplying out (i — /)“'" (i — = (* — /)-«-»» 

obtain (m 4-«4 )r = (^ 4 -)r (" 4 -)© 4 -(w 4 )r-i (« 4 -)i 4 - . . . 

-1- (w-l-)o(« 4 -)r (13-25) 

and for comparison the binomial series can be written (equation 13.18) 

(OT 4 -«)r = (m)^ («)o 4 - («)i 4 . ... 4 - ('«)o ('Or- 

All these relations are true for any values of tn and w. 
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PROBLEM 

(i) What relations does one get by expressing in 

series form? 

13.12 The solution of differential equations by series 

Suppose we wish to solve the differential equation yt = Ky. Let us 
suppose that the solution y has a Taylor series pa pxt + + 

p^t^ . where the p^^ are numbers we wish to determine. Then 

yt^Pi + + ZPzt^ + 4^4^^ + . . . . 

Ky = Kpa + Kp^t + Kp^t^ + Kp^t^ +- 

and the equation yt — Ky shows that these two series must be identical, 
i.e. 

Pi — Kp 0 

2^)2 = Kpi, that is, p2 = Kpjz = K^PqI\^ 

3/^3 = Kp.,, that is, ps = KpJz = K^pal\^^ 

= Kp^, that is, />4 = -^>3/4 = K^p^l |£ 

The solution is therefore 

y — p Q P\^ d” p 2 i“ H~ + . .. . 

= /)o + Kpat + K-paP^j\l^ + K^pat^l\Z + • • • 

^Pa^^ 

This solution introduces one new constant pa, as indeed it should, 
since we know that the solution of a first-order differential equation 
contains an arbitrary constant. 

This process can be applied to almost any differential equation. It 
will be successful except when there are solutions which do not have 
such a Taylor series (e.g. solutions like 3' = ijt or y = \/t). 


EXAMPLES 

(i) Solve the second-order equation j’tt = —co"y, where co is a con¬ 
stant. 

Let y pa + Pi^ d- Pit- etc. Then y^t = i.^.pn. d- 2.i,.pzt d- 
3.4./).(i- -i- . . . . and — w-y — —cj-pa — i^-p\t —ajrp^t^ — ... These 

series must be identical: 



1 

f 

that 

is, p., 



2-3 

^ -<^-pi. 

that 

is, />3 

■~^~Py\Z 


3-4 ^ 

■- —<^-pi. 

that 

is, p , -- 

of-po/ST 

W^p 0 /14 

4-5 /"o 


that 

•s. /’g — 

—W-P3/4.5 

= 15 



that 

is, Pg = 


= -t«®po/i6 

and y - 

^Po- Pi 

it -h. 

p.r- -F . 

« • • 
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— Po Pit — 


= />o (^i - 


^-Pgt 

li 


cu 


^Plt^ 




OJ 


+ — - 


11 

a;8/6 


+ ^"PyL" 


+ 


II 

) 






|5 

- ^toZ — 


OJ 




+ 


OJ 


5/3 


|3, |5 


- ) 


11 li 

~ Po cos oit + (/>i/c«) sin cut. 

This contains two new constants, p^ andas we would expect, and 
IS the general solution. 

(2) The equation + ty^ + — n^~)y = © is known as “Bessel’s 

^uation . If n is an integer then there is a Taylor series solution y = 

^Jn{t) where C is an arbitrary constant and known as “Bessel’s 

lunction of order is given by 


J„(/) = 




+a 


+ 




\1 I5I1±1 I?ll±f 1311+3 


+ 


(13.26) 


This cannot, however, be the complete solution, since it only contains 
one arbitrary constant, whereas the original equation involves the second 
derivative y,t and so is of the second order. It follows that there must 
be another solution Y„(i) to the equation which has no Taylor series 
about = o. Such a solution can be found, but has a very complicated 
form (see E. T. Whittaker and G. N. Watson, A course of modern 
analysts, Cambridge U.P., 4th edn, 1935). A biological application of 
these functions to population spread is given by J. G. Skellam, “Random 
dispersal in theoretical populations”, Biometrika, 38 (1951), 196. 

* 3 * *3 Multiple Taylor series 

The results of Section 12.11 suggest that a function y = /(/, M)of two 
variables can be expressed by an infinite series 

>=M 1 +5',].r+oj.iy++ fz>Ji, 

|2 |0 |,_|^ |0 |2_ 

the general term being ^vhere 

\P \‘L 

~ ^i)» ^ means (u — u,) and the suffix , means that we must put 
f — ti and M = w, after the differentiations have been performed. This 
IS generally true, provided that the series converges. We can also extend 

; 3 or more variables. The most important application 

IS the multinomial theorem” which enables us to convert an expression 
like {t + u -h V + uiy* into series form. This again is really only needed 

»n practice when « is a positive integer. We then get a finite series valid 
tor all values of t, u, v, and w, 

/•# I . . V \n t^^ut^v/zv^ 

\t u V w) = 2 ■ • • 

a4-/y+y-f6*« I® 1^ l>: 


etc. 


as usual T means 


(* 3 - 27 ) 
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We take all possible combinations of integers a>o, / 3 >o, y>o and 
8 > o such that + obtain one term 

^n.t^u^vytu^l |a 1^ |y IS 

for each such combination, and add all these terms together. 


Z3.14 The sigma notation for double sums 

Suppose we have 8 numbers arranged in a rectangle as follows 


ATll 

^12 

X 12 


^21 

X 22 

X 29 

^24 


(13.28) 


Such an arrangement could for example occur if we had samples from 
two dilferent nationalities, and had classified them into their blood 
groups O, Ay By or AB — 


Table 13.4—Blood group classification 


Nationality 

Numbers in blood groups 

0 

A 

B 

AB 

Total 

British 

•*11— 

108 

Xi 2 =^ 

III 

Xi 2 ~ 

22 

Xn~ 

9 

250 

Latvians .. 

^21 = 

55 

^22 — 

69 

^*23 = 

33 

*24“ 

IS 

i 

172 

Total 

1 

1 

163 

180 

55 

24 

422 


These numbers are taken for illustrative purposes from Ruth Sanger and R. R. Rac^ 
“Blood Groups in a Sample of 250 People”, Ann. Eugen. Loud., 15 (i 949 )» 77 > 

R. R. Race, Ruth Sanger, Sylvia D. Lawler and D. V. Keetch, “Blood Groups of 

Latvians”, Ami. Eugen. Loud., 14(1948), 134. 


Such a systematic arrangement of numbers in a rectangular array is 
known as a “matrix”. Here we have (ignoring the totals) an array of 
two rows and four columns: it is called a “2 X 4 matrix”. The use of 
two suffixes Xfg provides a systematic way of writing the numbers in the 
array. The first suffix r refers to the row, i.e. the nationality, i denoting 
British and 2 Latvian, while the second suffix s refers to the column, i.e. 
the blood group, i denoting group O, 2 A, 2 ^ and 4 AB. Thus X13 
means “the number of British persons of blood group B’\ ^ 

Now we can add all the numbers a;,.,, obtaining the “grand total’ 
•■^11 + -Via + A’ls .Vj4 -h X21 + .V22 + a *23 + .^24 = 422, as shown in 
the bottom right-hand corner of Table 13.4. This can be written as BXap 
or as Bxap meaning the sum of all the numbers av,. 
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Table 13.4 also gives “row totals*’ (right-hand column) such as 

T 2 + ^13 + ^14 = ^ Xip, meaning the total number of persons 
ot British nationality, and H- -h + x., = = the total 

number of Latvians. Addition of these again gives us the grand total 

+ 2 = 2 * JCaiS, (i.e. 250 -h 172 = 422). In 2 notation this can 
be written even more concisely: 


^{^Xap) — 2 A-’a^ . . . (13.29) 

« iS a, ^ 

^e. if we first add all the Xa^ for a given value of a but different values of 
P we get the total 2 Xafi of row a; a further summation of all these row 

totals gives the grand total 2.Va^. 

In the same way we can find column totals, x.^i = 2 .v«,, the 

total m^ber of group O, a-, 2 -f Aj.. = 2a„ 2. the total group A, 2 a,, 3, 
group B, and 2 Xa^, group AD. 'I'hese column totals are shown at tlie 
root of Table 13.4. The sum of these column totals must be the grand 
total; 163 -f- 180 + 55 -f- 24 = 422, or in general 


2 (2 Xa^) = 2Aa^ 
fi a n,fi 


^^tis it does not matter whether we first sum the columns, and then 
add the column totals together, or first sum the rows, and then add the 
row totals together. 

This discussion can readily be generalized to three-way, four-way, 
niore complicated classifications. Suppose we add a tliird 
such as a i for male sex, and 2 for female. 'Then A2,, would mean 
me number of persons of Latvian nationality, group AD, and male sex. 

•3^041 would be the total number of any nationality of group AD and 


male sex, and 2 a„^j = 2 (2 a,,^,) = 2(2 .v,,^,) would be the total num- 

ft P n 

her of males. The grand total would bc 2 .v„^y, and the summation can 
be done in any order, e.g. 2[2(2A«^y)l. or 2(2.v„^y). 

a fi y ft a,Y 

Another point worth noting is the product of two sums, such as 
^2-^X2 + A4)(jyj 4 -J2)- Hy direct multiplication this becomes 

(Xiyy + Ajy, -p A^J d- A,y, + A,>'2 T A 2^ 2 + X 2 + 

^J^o^yft summed over all possible combinations of suffixes a and jS. 

1 hus we can write 


(2 Aa) (2 yft) = 2 Xnyft 


\ 


It would be incorrect to write (2Aa)(2ya) = 
mean for instance (a, + a* -h Aj -Pa,) {y^ 
wnich in general is not true. 




2 Aaj>’«, for this would 
+ >’a) = (A',Ji -i-A:2y2) 
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Similar reasoning shows that Xa) (^^ypy) — 2 Xay^yt and that 
(2 Xa) i^yp) Zy) ~ ^Xay^Zyy and other formulas of this land. Such 
formulas are often useful, especially in the study of matrices (Chapter i8). 

These results can equally well be applied to infinite series, i.e. series 
such as 2 Xafi where a can take all values i, 2, 3, 4, . . . and so on indefin¬ 
itely, and similarly ^ can take all values i, 2, 3. . . . They will be valid 
if the series is “geometrically convergent”, that is, if we can find two 
positive numbers A and R with the properties (i) R < 1, (ii) 

< AR^'^’ for all values of r and s. However, such double infinite series 
are not very frequently needed in practice. 



CHAPTER FOURTEEN 


DIRECTED MAGNITUDES 


14.x Vectors and scalars 

Forces, velocities, electric fields, and electric currents all have this in 
common, that they have a direction as well as a magnitude. If a body is 
being acted upon by a force it is important to know not only what the 
magnitude of the force is, but also whether it is urging the body to 
move north, south, east, west, upwards, or downwards. A physical 
quantity which has direction as well as magnitude is known as a “vec¬ 
tor’* (or more precisely a “spatial vector”). In modern textbooks it is 
usually distinguished by heav)' type, e.g. v for a velocity or F for a force. 
A quantity which has only magnitude, such as temperature, volume, 
mass, or electric charge, is known as a “scalar”. 

Now we usually consider two measurements of length or mass as 
being “equal” if they have the same magnitude regardless of position, 
time, or other external conditions. A mass of i kilogram in London is 
regarded as equal to a mass of i kilogram in Paris, whether measured in 
the year 1900 or 1950: that is the normal way of using the word 
“equal”. In the same way we shall regard two forces or other vectors 
as “equal” if they have the same magnitudes and the same direction, 
irrespective of where and when we measure them. It is convenient to 
speak in that way. But if they are equal in magnitude but different in 
direction we shall regard them as unequal forces. There is one excep¬ 
tion to this. If the magnitude of a vector is zero, its direction is of no 
importance. A body acted upon by zero force is not urged one way or 
another, and it is futile to ask what direction the zero force is acting in. 
A body which is standing still can be imagined is moving with zero 
velocity in any direction. A vector whose magnitude is zero we shall call 
a “zero vector” O, and regard its direction as arbitrary or undetermined. 

The word “vector” is Latin for “carrier”, the point being that any¬ 
thing which is carried must be carried a certain distance in a certain 
direction. The word “scalar” is related to “scale”, since two scalar 
quantities can be compared on a single scale of magnitudes without 
reference to direction. 

14.2 Geometric representation of a vector 

From the definition of a vector as a quantity with magnitude and 
direction we see that it can be represented by a directed line AB^ the direc¬ 
tion of the line (from A to B') corresponding to the direction of the vector, 
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and the length of the line being equal to the magnitude of the vector 
(in suitable units) (Fig. 14.1). Any point A can be chosen as starting- 


1 



point for the line. If A' is any other point, and the line equal in 
length and parallel to AB, then u 4 'B' will equally well represent the 
same vector. However, suppose we choose arbitrarily any one fixed 
point or “origin” O in space (Fig. 14.i). Then we can draw a line OP 
equal and parallel to AB, and representing the same vector v. So for 
each vector v there will be just one such line, and just one such end-point 
P. Conversely for each point P there will be just one vector OP. In this 
sense we can speak of the point P as “representing the vector v = OP”. 
The advantage is that in a complicated diagram it is easier to draw a 
single point P to represent a vector, rather than a line OP or AB. The 
representation of a vector by a line is the more natural one, and we shall 
use it wherever possible. I'he representation by a point P has the dis¬ 
advantage that it is necessary to choose a fixed point O as origin, and 
such a choice is arbitrary. For when we say that the vector v is repre¬ 
sented by a point P, we really mean that it is represented by the line OP. 
But the use of a point P in this way is sometimes helpful. 

Note that the zero vector O is represented by any line AA of zero 
length, or by the point O. 

14.3 Components of vectors 

Let AB (Fig. 14.2) be a directed line representing a vector v, and let 
HK be another line in a direction D making an angle 6 with AB. Draw 



Fig. 14.2—The component UV of a vector AB along a line HK 
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AU, BV perpendicularly onto HK. Then the distance W is called the 
“component of v along HK” and is equal to the magnitude of v times 
the cosine of 0 (Section 5.6). Here UV is supposed to be measured with 
its proper sign, positive if the direction from U to V is the same as that 
from H to AT, and negative if it is the opposite direction. This agrees 
with the usual definition of the component of a force in mechanics. 

14.4 Magnification of a vector 

Let V be any vector, such as a velocity, represented by a directed line 
AB (Fig. 14.3). Then it is natural to say that the velocity is doubled if 
its magnitude is doubled, its direction remaining the same. Thus if 



Fig. 14.3-—Multiplication of a vector by a number, and 

addition of vectors 


A'B' is drawn parallel to AB but of twice its length, then A'B' represents 
V multiplied by 2, or 2v. Similarly the product kv where k is any positive 
number A, will be taken to mean the vector with the same direction as v, 
but k times greater in magnitude. 'J'he vector — hv will naturally sig¬ 
nify the vector kv with its direction reversed, c.g. if A'B' = 2 .v, then 
B'A' = — 2 .V, by definition. The vector BA is the reverse of AB, and 
will be denoted by — v, or (- - i )»•. 

In this way we can define the product of an ordinary number, posi¬ 
tive or negative, and a vector. We can show that this product obeys the 
law 

h{kv)=^{hk)v . . . (14.1) 

For {hk)v means the original vector multiplied ///? times in magnitude, but 
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with its direction unaltered (or reversed, if hk is negative), kv means that 
the magnitude has been multiplied by and the direction unchanged 
(except for a reversal if k is negative). If then we multiply {kv) by h we 
have in all multiplied the magnitude by hky and we have left the direc¬ 
tion unaltered, or reversed it, according to the usual rule of signs in 
algebra (+ve X -{-ve = +ve, H-ve X —ve = —ve, etc.) So h{kv) = 
{hk)v. 

Furthermore, if HK is any line, and we draw AJJ, BV, A'U', B'V' 
perpendicularly onto HKy then since AB and A'B' make the same angle 
with HK^ the ratio of UV to AB must be the same as the ratio of U'V 
to A'B\ each being the cosine of this angle (Fig. 14.3). So if A'B' — 
kAB, then U'V = kUV. If we multiply a vector by k, we also multiply 
its component in any given direction by k. 

Finally, if we multiply a vector by o we obtain a “zero vector” O of 
no magnitude. The direction of such a vector is immaterial: all zero 
vectors are considered as equal. 

14.5 Addition of vectors 

Vectors can be added together. Suppose that a body is acted on by 
two forces v and w. v, let us say, is represented geometrically by a line 
ABy and w by a line BC. Then we know experimentally that the two 
forces acting together have the same combined effect as a single force, 
whose direction and magnitude are given by the line AC. It is natural 
to call this combined force v -f h\ so that vectorially (Fig. 14.3) 

AC — V w = AB + BC . (14-2) 

This is the “triangle of forces” theorem: from our point of view, it 
amounts to a definition of “addition” for vectors. It can be used to 
combine many other directed quantities besides forces: for example the 
electric fields produced by two different charges add together to give a 
resultant field according to this law. 

"^l his method of addition of vectors is very’ similar to ordinary 
addition in its properties. The laws of operation for ordinary addition 
and multiplication have been set out in Section 3.2 (which the reader is 
recommended to re-read). Let us write out those of vectors for com¬ 
parison. 

Firstly let us draw W L , BVy CIV perpendicularly onto the line HK 
(Fig. 14.3). Then C 1 ' is the component of r = AB along HK, VWis 
the component of h- -- />C, and UJV the component of r + w. Since 
4 - I'IF- r IF (taking signs into account) we see that the com¬ 
ponent of V w along HK is the sum of the components of v and h». 

Secondly, let us complete the parallelogram ABCE by drawing AE 
parallel to BC and EC parallel to AB. Since opposite sides of a parallelo¬ 
gram are equal in length, we see that vectorially AE = BC = h», 
EC -- AB — r, and the relation AE -f- EC = AC becomes 

H* -hr — AC = r -h w . . (14*3) 
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(the “commutative law of addition”). In the same way if we are 
given three vectors, AB = v, BC = h», and CD = x, and we complete 

D 



Fig. 14.4—The sum of three vectors 

the parallelopipedon, as shown in Fig. 14.4, we see that vectorially 

AD = AB + BD = V + (w + x) 

= AC + CD ^ (v + H>) + X 
= AE + ED = (h» + .r) + V, etc. . (i 4 - 4 ) 

i.e. vector additions cati be performed in any order. 

The addition of O to any vector leaves it unchanged, for 

O + V = AA + AB = AB = v . . (i 4 - 5 ) 

This is only natural, for if we interpret the vectors as forces then O is a 
zero force, and does not change any other force on combination with it. 

Now let ABC (Fig. 14.3) be the triangle of vectors representing the 
addition v + »v = (v 4 - h*). Draw a second triangle A'B'C similar 
and parallel to ABC, but of twice the size. Then A'B' represents zv, 
B'C represents zw, and A'C z{v + h’). But vectorially A'B' 4- B'C 
—A'C'y i.e. zv zw = z{v 4- h»). In the same way 

kv 4- = k{v 4- . . (14-6) 

for any number k (the so-called “distributive law”). 

Finally we know that two vectors acting along the same line (i.e. in 
the same direction, or in directly opposite directions) combine like 
ordinary positive or negative numbers. A force of 2 kilograms weight 
acting downwards combines with a force of 3 kilograms weight, also 
acting downwards, to give one of 5 kilograms. We can write this 

hv 4- kv ~ {h 1 k)v . . . (14-7) 
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In short, veaors obey exactly the same laws of addition and multipli¬ 
cation as ordinary algebraic symbols, except that so far we have given no 
method of multiplying 2 vectors together, but only an ordinary number 
and a vector. Apart from that restriction all the usual algebraic formulas 
hold good. 

14.6 Subtraction of vectors 

Let V and w be any two vectors; then it is natural to call the vector 
sum H' -f- ( — r) the “difference w — v between the vectors w and v*\ 
This is easily interpreted geometrically. Let v = AB, w = AE 
(Fig. 14.5). Then w ~ v = (~ v) -j- w = BA -h AE = BE. Since 





Fig. 14.5—Vector subtraction 


vectorially AB -h BE = AE we have 

« 

V {«■-,■)=„• . . (,4,8) 

Thus (k- - »■) is the vector which has to be added to v to make it 
equal to b’, which corresponds to the usual idea of subtraction as the 
reverse of addition. 

Also we have from this definition 

y ~ y B/i = O (14.9) 

Thus subtraction also obeys all the ordinarj- algebraic laws. 


example 

(i) To show that the three medians of a triangle meet in a point. 

A median of a triangle ABO is defined as the line joining a vertex 
to the mid-point of an opposite side; e.g. AP, where P is the mid-point 
of liC (Fig. 14.6). 

Now choose an origin O. and call the vectors OA, OB, OC, a, b and 
c respectively. Then by the subtraction law, AB = b — a, BC = 
c — h. But BP -- WC -= \{c - b), so that AP = AB BP = 
[b — a) 'I b) = ^b -r \c -- «. Now choose a point G on AP 
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A 


0 

Fig. Z4.6 —The centroid G of a triangle ABC is the intersection of 

the three medians 

and § of the distance along AP, so that AG = ^ AP = 4- Jc — §a. 

It follows that 

OG= OA -\AG=a+{hb + ic ~ la) 

= + ib -H 

Thus we have shown that G lies on the median AP through A. But the 
vector OG is expressed symmetrically in terms of a, b, and c, so it must 
equally well be true that G lies on the medians through B and through 
C. The point G is called the “centroid” of the triangle ABC. 

PROBLEM 

(i) Show that in any triangular pyramid (or “tetrahedron”) the four 
lines joining the vertices to the centroids of opposite faces all pass 
through a certain point G. Also show that the three lines joining the 
mid-points of opposite edges also pass through this point. 

14.7 The magnitude of a vector 

It is customary to denote the magnitude of a vector v, irrespective 
of its direction, by the symbol jr] (read as “mod v” and called the 
modulus or absolute magnitude of v). Thus |vj is an ordinary positive 
number. It obeys the law 

I***! = |A| kl . . . (14.10) 

Also, since no side of a triangle can exceed in length the sum of the other 
2 sides, 

AC\ < \AB + BC\, 

i.e. V 4 - H-| < |r + H-l . . . (14.11) 





378 


BIOMATHEMATICS 


PROBLEM 

(i) Prove that |v — h>| < |vl -f 

« 

14.8 Vectors in a plane 

All properties of vectors discussed so far are completely general, and 
true without any restriction. But from this point onwards we shall 
restrict ourselves to vectors lying in a plane. 

The reason for doing so is as follows. To express a physical quantity 
in numerical form it is necessary first to select a unit, and then to say 
what multiple of this unit is equal to the quantity in question. Thus if 
we are measuring the length of a foot rule, and our unit of measurement 
is an inch, we shall find that it is 12 inches long. If the unit is a centi¬ 
metre, it will be 30*48 cm. 

Now it would be very convenient if we could do the same for a vec¬ 
tor, expressing it in turn as a multiple of a unit. Naturally this cannot 
be done in terms of ordinary numbers alone, for a velocity of (say) 2 
metres per second eastwards is not a multiple of a velocity of i metre per 
second northwards. But it can be done by allowing a generalization of 
the idea of “number”. And this is not an unreasonable device. For the 
idea of number has been generalized several times in the course of 
history. It began with positive integers, used in counting. But men 
soon discovered that it w'as useful to supplement these with fractions, 
which were used in measuring. Later it was found convenient to invent 
negative numbers, and irrational numbers such as y/'z which cannot be 
expressed as an exact fraction. If the reader is prepared to accept yet a 
further generalization it will be possible to include the measurement of 
vectors in a plane. Unfortunately it is more difficult to do the same for 
general vectors in three-dimensional space; and that is why w'e have to 
confine our attention henceforw’ard to a single plane. 

Choose arbitrarily any vector 01 = / (say) in the plane, and call it 
the “unit vector”. Now consider any other vector v = OP, of length r. 
This in general differs from /, the unit, both in magnitude and direction: 
it w'ill be r times greater in magnitude, and it will differ in direction by 
some angle, say Q (Fig. 14.7; B will be measured as [positive if the 



Fig. X4.7—Complex numbers 
The measure {r, fl) of a vector v in terms of a unit / 
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direction of rotation from OI to OP is anticlockwise and negative if clock¬ 
wise, according to the usual convention). We shall then say that the 
measure of v in terms of the unit I is {r, 0 }. We shall look upon 
{r, 6 } as a single number of a new type, called a “complex number” z 
(“complex” because it is composed of two parts or ordinary real num¬ 
bers r and 6 ). Probably the simplest way of thinking of the complex 
number {r, is to consider it as the sign for an operation, namely 
the operation of magnifying r times and then rotating through an angle 6 . 
The performance of this operation on the vector / gives us the vector v. 
We look upon v as the unit vector I “multiplied by” the complex num¬ 
ber z — {r, d}y or in symbols 

V = zl= {r, e}/ . . . (14-12) 

In contrast to this an ordinary positive number h can be taken simply 
as a sign of magnification alone. For when we say that i foot has the 
measure 12 when expressed in units of i inch, we mean that a foot is an 
inch magnified 12 times. If v is a. vector, then the product hv signifies 
the vector v magnified h times, but unaltered in direction. In the same 
way a multiplication of a vector v by a complex number {r, 0 } means, 
by definition, a magnification r times and a rotation through the angle 6 . 
In particular a multiplication by the complex number {h, 0} is simply 
an A-times magnification, and has exactly the same effect as a multipli¬ 
cation by the positive number h. Thus the complex number {h, 0} is 
equivalent to the real number h in its effect on vectors—which is how a 
complex number is defined. For all practical purposes we can consider 
h and {A, 0} as identical. 

Similarly to multiply v by (—A) means to magnify it A times and 
reverse its direction. Multiplication of v by the complex number 
{A, tt} magnifies it A times, and rotates it through tt radians, = 180 , 
i.e. this also reverses its direction. Thus the complex number {A, tt) 
is exactly equivalent to the real negative number (—A). In short we can 
consider the ordinary real numbers we have so far dealt with as parti¬ 
cular cases of complex numbers, those in which ^ = o or tt. 


14.9 Multiplication of complex numbers 


Complex numbers will hardly be worthy of the name of “numbers” 
unless we can add, subtract, multiply and divide them. Mow we do so 
is of course a matter of definition, not of fact. We are free in principle 


to define “addition”, “subtraction”. 


“multiplication” and “division” for 


this new type of number in any way that takes our fancy. But since 


complex nunibers have been shown to include real numbers as a parti¬ 
cular case, it is clearly desirable that whatever definitions we choose for 


the addition, etc., of complex numbers agree with the familiar ones for 


the special case when the numbers arc real. It is also desirable that the 
ordinary properties of the signs —, X and : should be preserved as far 
as possible, and that the definitions should sound plausible and reasonable. 
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The operation which has the simplest definition is that of multipli¬ 
cation. Let us first go back to the idea of multiplication for real num¬ 
bers. Suppose that a magnitude is JCi times a magnitude and M2 
is a: a times M3. Then is ATiara times M3. For example, i yard = 
3 feet; i foot = 12 inches; so i yard = 3 X 12 inches. 

This suggests a rule for multiplying complex numbers. Suppose 
that a vector Vi is afj = times a second vector Vai i*e. Vi is 

obtained from Vg by magnifying it rj times and rotating it through the 
angle Suppose further that Vz is Zz times Vg i.e. is obtained by a 
magnification Tz and rotation $2- Then it is natural to say that Vi is 
(ziZz) times vg. That is, if Vj = ZiVz and Vz ~ ZzV^ then 

Vi = Zi ~ • * (t 4 ‘t 3 ) 

But (Fig. 14.8) Vi is obtained from Vg by 
magnifying it TiTz times and rotating it 
through 61 + 0 a, so that 

•^ 1 ^ 2 = {''1* ^2}= ^1 + ^2} 

(14.14) 

This is the definition of multiplication. If 
we interpret Zi and Zz as operations of 
magnification and rotation, then the product 
ZiZz will be interpreted as the effect of 
successive application of the two operations. 
First (to change V3 into >*2) we magnify Tz 
times and rotate through an angle dz\ 
secondly (to change Vz ir^t® Vj) we magnify 
rj times, and rotate through The total effect is a magnification r^Tz 

and a rotational + 02- 

From (14.14) we see also that 

= {^2» ^2 }{''!> ^1} = ^2 + ^1} 

= {riTzy + ^ 2 } 

= ZxZz • • ( 14 *^ 5 ) 

so that multiplication of complex numbers can be done in either order 
just as for real numbers. Also if = (rg, 0 s} is a third complex 

number 

— {^i» ^2 + ^3} = {''i'' 2 ^ 3 . + ^2 ^3} 

(iri;C2)"3 = “b ^2}(^3' ^ 3 } = 3» ^1 + ^2 

{zzZ:i)zi = {r^rg, 0^ + 03 }{r„ 0 i} == {rir^r^, 0i + 02 + 03} 

i.e. z.iz.Zz) = (-i-2)"3 = (^2^3)^!. so that multiplications of any num¬ 
ber of complex numbers can be done in any order, just as for real num¬ 
bers (“commutative” and “associative” laws, in technical language). 



Fig. 14.8—-Multiplication 
of complex numbers 
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There is one special case of interest. Multiplication by {o, 6 ) 
means by definition “magnification o times, and rotation through 
angle 6 ". But magnification by o results in everything being reduced to 
zero, independently of the angle 6 . Thus it is convenient to consider 
{o, as equal to the real number o whatever the value of 6. We 
then have 

o;j = o . (14-16) 

whatever z may be, exactly as for real numbers. 

14.10 Powers of complex numbers 

Since multiplication of complex numbers obeys laws similar to 
ordinary multiplication it is natural to denote the product zz of z into 
itself by the symbol and call it the “square of z”. Similarly zzz is 
denoted by z^ and so on. Such powers obey the usual laws, e.g. 
z-z^ = (5:5:) {zzz) — zzzzz = z'^, and in general = s"‘ 

In terms of the {r, 0} notation we have by the rule (14.14) for 
multiplication 

-2 = ^r, 6} {r, 0) = {rr, 0 + 0 } = {r-, 2 O) 

= {r, 0) {r, 6) {r, 6} = {rrr, 0 + 0 + 0} = 3 0} 

and in general z** = {r, 6 )’* = {r'‘, h0 }. 

X 4 .XX Multiplication of real numbers 

We have now to show that this definition of multiplication for com¬ 
plex numbers agrees with the usual definition when the numbers are real. 

Consider first the case of two positive numbers // and k. As we have 
already seen, h is effectively equal to the complex number {//, o), and 
similarly k ~ {/«, o}. 'I'he product of the two real numbers h and h 
is hk, while the product of the two complex numbers {//, 0} and 
{k, 0} is {hk, o -b o} = {hk, o). If the two definitions are to agree, 
these two products must be equal, i.e. hk -- {hk, o}. But that is true 
since hk is positive. 

Now consider two negative numbers —h and ~ k. According to the 
ordinary rule for multiplication their product is positive and equal to 
hk. Now when we interpret —h as a complex number it becomes 
{A, rr}, i.e. a magnification by h, and a rotation through n radians 
= 180®. Similarly —k — {k, tt}. 'I'he complex product of {h, n) and 
{k, rr} is by definition {hk, rr tt) = {hk, 2rr}, that is, a magnification 
by hk and a rotation through 180° -b 180® — 360°. But a rotation 
through 360® is equivalent in its effect to no rotation at all, i.e. 

{hk, ztt) = {hk, o). 

This is equivalent to the real number hk, so the two definitions of multi¬ 
plication again give the same answer. Incidentally this provides an 
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interpretation of the usual rule that the product of two negative num¬ 
bers is positive: it means that two rotations through 180° combine to 
bring the vector back to its original direction. 

We leave the reader to verify in a similar way that the agreement still 
holds in the remaining case of the multiplication of a positive and a 
negative number. 

14.12 The square root of minus one 

By our definition, multiplication by a number {i, B} means 
rotation through the angle B without change in magnitude. Thus 
{i, 0} or {i, 27r} signifies no change, and is equivalent to multiplication 
by I, while {i, tt} merely reverses directions, and is equivalent to —i. 

Now consider {i, \tt}. This means rotation through a right angle 
in an anticlockwise direction: we call it Then 

1*2 = a = {i, ^77} {i, ^77} = (i, 77} = —I. 

i.e. a rotation thrpugh 90° repeated is equivalent to one through 180°, or 
a simple reversal, i is therefore the famous “square root of minus one”, 
or rather i is one such square root: another is —i = {i, for 
(—i) = {i, 377}, and a rotation through 377 radians is equivalent to 
one through tt, i.e. {i, 377} = {i, 77} = — i. 

When mathematicians first began considering the square root of minus 
one, they could not see any simple interpretation of it, for they knew 
that the square of any ordinary number was positive. So they called the 
square root of a negative number, such as i, an “imaginary number , 
and ordinaiy numbers such as 2, 77, or —il were called “real” numbers. 
Nowadays we know that i has a simple interpretation: it is a rotation 
through 90®. The equatioit — —i means simply that t^vo successive 
rotations through 90® are equivalent to a reversal. So i is no more 
imaginary than any other number. But the old names still remain, 
and we talk of 2, tt, and —5 as “real” numbers, and *, ziri and — 
as “imaginary” numbers. These terms are now purely technical, and 
no longer imply that a “real” number really exists, or that an “imagin¬ 
ary” one doesn’t. 

Consider further the number oj = {i, ^tt}. This means a rotation 
through 377 radians = 120®. Thus if we perform the rotation three times 
we come back to the original position, cu® = {i, 277} = i. Thus w 
is a complex cube root of i. Another cube root is co* = {i, for 
= (i, 477} — i; and of course i is also a cube root of i. So 
I has 3 complex cube roots, i, to, and oj®. It has 4 fourth roots, i, 

—I, and —I, and we can show that in general it has n nth roots. 


PROBLEM 

(1) What are the n wth roots of i ? 
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Z 4.13 Division of complex numbers 

In ordinary arithmetic the quotient xjy means that number which 
when multiplied by y gives x. We shall therefore take the quotient 
of two complex numbers ZijZi to mean that number which gives a-, 
on multiplication by z^. Now if = {rj, and z^ = {r^, ^2}* 
then to get from 5^2 to z^ we have to multiply the magnitude of ^ 2 by 
Tijr^y and rotate it through an angle 6^ — 62’, i.e. 

^1/^2 = {''1. O2} = {rjr2, — 62} (14-17) 

This is the definition of division of complex numbers. It has the 
property we require, for 

^Z ijz^z 2 = ^1 ^2}{^2> ^ 2 }” {^li ^ 1 } ~ ^ 1 

by the rule (14.14) for multiplication. We also see that the definition 
(14.17) fails to give a definite quotient only if rg = o, i.e. Z2 = o, so 
we can divide any complex number by any other complex number 
except o. 

In particular the number i/a^ = {i,o}/{r, 0 } = {i/r, —d} is called 
the “reciprocal” of z. 


PROBLEMS 

(1) Show that i/i = — I. 

(2) Show that = —I. 

(3) Show that i/tu = a>*. 

(4) Show that zjzi = Z2 • {ijz^). 

(5) Show that (ZiZ2)'^ = • Z2^. 

(6) Show that (ilzi) {ijz^ = 

(7) What are the cube roots of — i ? 

(8) What arc the square roots of co? 

(9) Show that every complex number z = {r, 0 } has exactly two 
square roots, except the number o which has only one. 

14.14 Geometrical representation of a complex number 

A complex number is the measure of a vector v in terms of a unit 
vector /. If therefore we choose arbitrarily any vector 07 in the plane 
as unit vector /, we can represent the complex number z =< {r, 0} 
by the vector v = zl. v has length r and makes an angle 0 with /. 
The number i corresponds to the vector / itself (= i/), and the number 
o to the vector O. Various complex numbers are drawn in this way 
in Fig. 14.9. 

If the number Z2 = {r*, 0 ^} is represented by a vector Vj, then 
the product ZiZ2 = {r„ ^2} >8 represented by the vector z^Vzy 

i.e. by Vg magnified rg times and rotated through 

Furthermore the absolute value or length or magnitude of the 
vector V = zl simply \v\ = r. So we call r the “absolute value” 
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“magnitude”, or “modulus” \z\ of the complex number z = {r, $} 
Since (rj, ^2} = {''i''2» + ^2} we have 

1^1^21 = ^1^2 ~ l^li 1^2! * • • (^ 4 **^) 

For a positive number \h\ — \ {hy o}| =for a negative number 
|—/ij = \{hy 7r}| = hy so that for real numbers this agrees with our 
former definition of the modulus as magnitude irrespective of sign. 

The angle 6 is called the “amplitude” of the complex number 
{r, d). Unfortunately this is not uniquely determined: the number 
{r, 0 } can equally well be written (r, 6 + Ztt}, or {r, 0 + 47r}, since a 
complete revolution has no effect. So if 0 is one possible value of the 
amplitude, 0 + 2?/7r is another value, where n stands for any integer. 

14.15 Complex addition and subtraction 

So far we have not given any definition of “addition” for complex 
numbers. But we have a definition of addition of vectors, and we have 
a method of representing complex numbers by vectors. So it is natural 
to combine these facts to supply the required definition. Let z^ 
be any two complex numbers, represented by the vectors Vj and Vg. 
Let V3 = -h Vz be the sum of Fj and Vg, and let Z3 be the corres¬ 
ponding complex number. Then it is natural to call z^ the “sum of 
Zi and Zz \ and to write 3:3 — jSi + ^2- Such an addition c^ be per¬ 
formed graphically by using the triangle law of vector addition. Thus 
in Fig. 14.9 the vector OB represents co = (i, fir}, and the vector 
BC represents i = {i, J^r}. The sum Oi? + SC = OC therefore 
(by definition) represents the number a» + i. By measurement or 
trigonometric calculation we find the length OC to be VS 
angle /_IOC to be iV so that 

CO f = {\^3, rs tt}. 
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This rule for addition must obey all the ordinary laws of addition, 
since vectors do so. For instance, since Vj + Vg = >’2 + it follows 

that Zi -h ^2 = Za -h Zi, and since + v^l < |v,| + 1^2] (formula 
14.11) it follows that 

l“I +^2! I^ll + I- 3 ' 2 | * . . (14.19) 

It also agrees with the general rule for the addition of real numbers, 
which correspond to vectors lying in the same straight line as /. 

We can define the difference ar, — Zo of the two complex numbers 
Zi and Z2 as being the number represented by the vector r, — Vj- 
Since subtraction of vectors obeys all the usual algebraic rules, it follows 
that the same will be true for complex numbers. For instance, since 
(^i —>’2) + »’2 = we see that (.s, — So) -h Z2 = z,. 

PROBLEMS 

Prove the following results: 

(1) CU + = —I. 

(2) oj — = \/3 , 1. 

(3) For any number z, z — z = o. 

14.16 The distributive law 

We have now defined all four arithmetical operations for complex 
numbers. We have shown that as far as multiplication and division 
are concerned the rules of manipulation are exactly the same as for 
ordinary real numbers. The same applies to addition and subtraction. 
One further property remains to be proved: the so-called “distributive 
law” z(zi 4 - Z2) = zz, -{- ZZ2 which involves both multiplication and 
addition. 

But this is easy to establish. Let ir, be represented by a vector AB 
and 22 hy a vector BC: then 2r, + ^2 = AC (Fig. 14.10). Now if 



Fig. 14.10—The distributive law for complex numbers 
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Z = {r, e) then zz^ is represented by a vector obtained from 

AB by magnification by r and rotation through d. The vector B’C 
representing zz^ is similarly obtained from BC. Thus A'C = A'B' A- 
B'C = zz^ + zz^. But all we have done is to magnify the whole 
triangle ABC r times, and rotate it through &\ so A'C' = {r, 6 } AC^ i.e. 

zzj^ + zzz = z(zx Zi) . . . (14.20) 

We can now prove that any ordinary algebraic formula is true for 
complex numbers. For example, 

(^1 + = (^1 + ^2X^1 + === (^1 + + (^1 + ^2)^* 

(by 14.20) 

= z,{zi +Z2) +^2(5^1 +^2) 

= 2:,3:1 + Z1Z2 + S 2 ^l + ^ 2^2 
= + Z1Z2 + ar,aro + z^^ 

= zY + 22i2r2 + 32®* 

In general we can do with complex numbers anything that can be done 
with real numbers, and also many other things that were previously 
impossible. Thus we are no longer prevented from taking the square 
root of a number because that number is negative. Other ways in 
which we have greater freedom of action will appear presently. 

As a rule therefore an argument will gain in generality if we use 
complex numbers instead of real ones, for real numbers can be con¬ 
sidered as a special case. Before we illustrate this further it may be 
as well to point out the few exceptions. If 3 is a complex number we 
can no longer assert that 3® is positive or zero, as is the case for real 
numbers. Neither can we compare complex numbers. If represents 
a force of i unit pointing eastwards, and Vg a unit force northwards, 
then r, and are certainly unequal, but it would make no sense to 
say that one was greater than or less than the other: relations such as 
“greater” or “less” apply only to real numbers. It is true that we can 
say of two vectors or complex numbers that the first is greater than 
(or equal to, or less than) the other in magnitude. But such a relation 
can be written | 3 || > |32| (or | 3 ,| = or \^i\ < ^2!) ^xid is rnerely 
a comparison between the two positive numbers | 3 , and | 32 | with no 
new principle involved. But apart from these rules, everything which 
is true of real numbers is true also for complex ones. 

14.17 Real and imaginary parts 

Let the complex number 3 = {r, d] he represented in a plane 
by the vector OP = v. We shall as usual denote the unit vector I 
by 01. Then the point P will have polar co-ordinates (r, 6 } relative 

to 01. 

Now draw two co-ordinate axes in the plane, the x-axis OX along 
07 , and the _v-axis OY at right angles to OX (in an anti-clockwise 
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Fig. 14.11—Real and imaginary parts of a complex number 

z = {r, e} s= X + yi 


direction). Then if P has cartesian co-ordinates (a.*i >) (Fig. 14. ii), 
and we draw PU perpendicularly on to the A’-axis, the length OU will 
be X, and the length UP will be^'. Now vectorially OP ~ OU 4- UP. 
Here OP represents the number z, OU = x . 07 , and therefore repre¬ 
sents the real number a, and UP is y times a unit vector pointing in 
the direction of the jy-axis, and so represents yi, where i = {i, 4”^}. 
Thus OP = Of/ -j- UP is equivalent to 


z = X'{-yi .... (14.21) 

where x and y are ordinary real numbers. 

This notation “x -f yV' gives us an alternative way of writing a 
complex number. It is related to the {r, 0 } notation in the way that 
cartesian co-ordinates are related to polars, that is 


Thus 


X = r cos $ y = r sin 6 

\z\ = r = ^/{x- + y"^) . , . (14.22) 

z = (r, 6 } = X iy = r cos 6 + ir sin 6 

= r (cos 6 i sin 6 ) . (14.23) 


For example, = {i, = cos §77- -}- 1 sin = — 4 + 

X is called the “real part” of the complex number xr(x = Rlxr), and y 
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the “imaginary part” {y = Im z). Fig. 14.12 shows diagrammatically 
several complex numbers written in the form x + yi. 

This form of writing a complex number is the most convenient 
one for most calculations. 


EXAMPLES 

(1) Add (3 + 4i) to (2 + 50 - 

(3 + 4 *) + (2 + 5 *) = (3 -h 2) + (4/' + sO = 5 + 9 * 

(2) Subtract (2 + 3O from (6 -f 7 *)- 

(6 4 - 7 *) - ^2 -t- 3O = (6 - 2) + (7* - 3O = 4 + 4 * 

(3) Multiply -h 0 hy f- 

i(i 4 - 0 = f + _ j 

(4) Multiply (2 4 - 0 hy (i + 21). 

(2 4 - 0 (* + 2O = 2(1 4 - 21) 4 - i(i + 21) 

= 2 4 - 4* + f + = 5 * 

(5) Divide 3 4 - 41 by 2 4 - 3*. 

Here the trick is to multiply numerator and denominator of the 
fraction (3 4 - 40 /f 2 4 - 3*) by (2 — 30 > to make the denominator real: 

3 +_ 4 .* ^ ( 3 ^_ 4*')(2 - 3 *) ^ 3(2 - 3 *') + 4*‘(2 - 3 *) 

2 4 - 3 * (2 + 3*)(2 — 3 *) (2)^ — (3*)^ 

_ 6 — 9/ 4- 8* — 12*- 

T+'g^ 

_18 — * 18 I . 

13 ~ *3 *3 

PROBLEMS 

(1) Add (3 - 21) 4 - (6 4 - 7*) + (* - 0 - 

(2) Multiply (i +0 by (* — i). 

(3) Divide (i 4 - 0 hy (i — 0 - 

(4) Divide 2 4 - 3 * hy t- 

14.18 Complex conjugates 

The complex numbers .v + iy and .v — iy are said to be “conjugate”: 
they differ only in the sign of the imaginarj' part. 

— y. then X — iy is usually written z or z* (read as 


bar” or “2 star” respectively). Thus, for example, 2 4- 3* = 2 — 31. 
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These conjugates have several important properties. Let = 
^1 + fyi and Zz — X2 + iyz be any two complex numbers. Then 
Zi Z2 = {xi + i>i) + {Xi + iy^) = (JCi + :V2) + i(yi +>'2). Now by 

definition Zi =Xi — iyi, Z2 = X2 — iy2, so that 

+ ^2 = (^1 + ■*’2) — t{yi + J2) = ari + «2 • . (14-24) 

i.e. the conjugate of a sum is the sum of the conjugates. 

The following similar results are left as exercises for the reader. 


PROBLEMS 


Prove that: 


(0 

( 2 ) 

( 3 ) 

( 4 ) 

( 5 ) 

( 6 ) 

( 7 ) 

( 8 ) 

( 9 ) 

(10) 

(11) 

(12) 

(13) 

(14) 


The conjugate of {r, ^}is {r, — 0 }. 

If z is real, then z = z, and conversely 


t = —I. 


Ol 




^ ^ ^ ^ 
1 iC O ^ 1 


Z\Z 2 ~ ^2* 

z z = 2x (where z = x iy) 

z — z = zyi. 
rw*r — I ^1 2 


Z^ = 


I + 22: + 3 -s* = I 4 - 25 r + 32=1. 


_g.n _ 


1/2' = ijz. 


Interpret z geometrically. 


There is one other important property. Suppose we have an equa¬ 
tion such as 2 +22 +2* 4 2® = o with real coefficients. This equation 
can have solutions or "roots” which are real; it can also have roots 
which are complex and not real: e.g. 2 4 22 4 2^ 4 2® = o has the 

roots —2, I, and —i. We can prove that if 2 is a root then 2 is also a 

root: and if 2 is not real 2 will be different from 2. We do that as 
follows. Let the given equation be A 4 Bz 4 C2® 4 Dz^ = o, 2 being 
any root. (A cubic equation is chosen for the sake of illustration, but 
the argument remains valid for any polynomial.) It follows, on taking 

the complex conjugate, that ^ Bz C2* 4 = 0 = 0 also. But 

by the rules we have given above 
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A~^Bz + Cz^ + = A + ^ A- 

= A +Bz -\-Cz^ 

= ^-^Bz -hCz^ -{-Dz^. 

Since Ay 5 , C, D are real, A — A, B = By C = C and D — Dy 
and therefore A + Bz + Cz^ + Dz^ = o, i.e. z is also a root. 

14.19 Complex logarithms 

We now consider the problem of extending the ideas of powers, 
exponentials, trigonometric functions, and so on, to complex numbers. 

Can we find a logarithm of a complex number? We speak of 
“finding” such a logarithm: strictly speaking of course it is a question 
of definition: we are free to define a logarithm in any way we choose. 
But such a definition will seem arbitrary unless the new kind of logarithm 
so defined has the same kind of properties as the old one, and unless 
it agrees in value with the old when z is real. Now the most important 
property of the logarithm function is that it reduces multiplication to 
addition, i.e. that log (ab) = log a + log b. Is there a similar function 
log of a complex number z such that 

if C3 = x;jS-2 then log 5^3 = log Zi + log Z2^ . . (14*25) 

Let Zi — {rj, 61 }, Zz = {r-, ^2}* ^3 ~ {^3» ^3}* then since ZiZ^ = 
z^ we know that = ^i^2» I*®* 

log Ta = log ri H- log ra (with ordinary logs) 

and ^3 = + ^2 

'rhus if A and B are any numbers whatever (real or complex), 

(.4 log rg -r Be^) = A (log rj -f log r^) -r S (0, + ^2) 

== (A log Tj + B 0 i) + {A log ra + BBi) 

Thus any function of the form A log r B 0 (with log r according 
to the old definition), will satisfy the fundamental property of 
logarithms. However, we shall also require our new definition to be 
equivalent to the old definition when z is an ordinary positive number, 
so that log {^,0} according to the new definition must equal log r 
according to the old. That is, A log r -f B.o must equal log r; clearly 
this will be true if we take .4 — 1. We have thus extended the idea of 
logarithms to complex numbers, log {r, 0 } = log r B 0 . 

However, there is still one further condition which it is desirable 
to satisfy. Consider natural logarithms. If .v is small and real we know 
that In (i -}- .v) = X — kv" + — • • • === -v to the first order of small 

quantities. Can we arrange for this to be true also for the complex 
logarithm, i.e. if \z\ is small can we arrange our definition so that 
In (i j c) ~ c, neglecting small quantities of the second order? 
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Let 01 be the unit vector (Fig. 14.13) and let IP represent a small 
complex number z = x iy. Draw PU perpendicularly onto OI 
produced: then lU = x and the length UP = y. 



Fig. 14.13—The value of In (x -f z) when z is small 


Now the vector OI represents the number i, and IP represents z, 
and vectorially OP = OI -h /P = i z. Suppose that i z = 
{P, 0 }; then R means the length OP, and 0 the angle /_IOP. 

Since z = IP is small we see that 0 is a small angle. So OU = 
OP cos 0 = OP = P to the first order of small quantities (see Section 
6.12). Therefore 

R=^OU = OI + lU ^ I + jc. 

Also y = PU = R sin 0 

P0 to the first order, since 0 is small 

cs; (I + -v) 0 

0, neglecting the product a:0 which is of the second order 
of smallness. 

Thus I + z = {R, 0 } = {i + .Y, j}. 

According to our new definition of logarithms (suitable for complex 
numbers) we must take In {P, 0 } to mean In P + P 0 ; that is 

In (i 4- 2) = In (i + -v) 4- By 

=s A- 4 - By since x is small. 

But we wish to arrange our new definition so that In (i 4- 5:) is approxi¬ 
mately equal \.o z = x + iy when z is small, and this will be so if we 
take B to be 1. 

Thus it seems appropriate and reasonable to define the natural 
logarithm of the complex numbers' = {r, 6 ) to be In r 4- i6, where 
In r stands for the usual real natural logarithm of the positive number r. 
However, it will be convenient to pi^kc a distinction in the next few 
sections between the definition for re^J numbers, as given in Chapter 6, 
and the new definition for complex numbers. For the time being, 
therefore, the complex natural logarithm will be denoted by the symbol 
Ln s, with a capital letter, and is accordingly defined as 

Ln s = Ln {r, 0 ) = In r 4-. . (14.26) 

(This convention concerning the use of the capital is not generally 
adopted, but seems helpful here.) 

I'he following properties accordingly follow from this definition: 
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p) It agrees with the usual definition for real logarithms. For any 
positive number r can be considered also as a complex number 
{r, o); and from (14.26) 

Ln {r, o } = In r 

(ii) It obeys the addition law, Ln = Ln -f Ln z^. For if 

^1 = {^1* ^1}, and z^ — {Tz, ^2}» then 

{f'if'2 y “b ^2}' So 

Ln Zi — In rj + 

Ln Z2 = In rg + id^y and 
Ln {Z1Z2) = In (r^r^) + i (^i + e^) 

= In ri + In -h tOi + iS^ = Ln z^ -\- Ln z^. 

(iii) If z is small, Ln (i + -a) =5^ z, as shown above. 

There is, however, one new feature about the complex logarithm 
which is not shared by the real logarithm: it is many-valued. For the 
angle 0 associated with the complex number z is not uniquely deter¬ 
mined; we could equally well take it to be 0 + ztt, or -0 -{-477, or 
^ — 277 . For 277 radians = i complete turn, and we can add any 
number of complete turns to an angle without altering the final result. 
It follows from {14.26) that In r 4 - -f 277), In r i(B + 477) and 
iii general In r -j- T 2H77) are all possible values of Ln z. Thus 
Ln 2: is a function with an infinite number of different values, any 
two of which differ by a multiple of ztH. It may be compared in that 
respect with the real function tan”*flr, which also has an infinite number 
of values. For if ^ = tan“'.v, i.e. .v = tan 0 , then x is also equal to 
tan (ff 4 77), tan (f? 4 277), etc., so that ^ + 77, 0 4 277, ... and in 
general 0 mr are all possible values of tan“^v. 

It follows that if we wish to be strictly accurate we should modify 
the statement of the properties (i), (ii) and (iii) above as follows: 

(i) ' In r is one possible value of Ln r = Ln (r, o}. 

(ii) ' Whatever z^ and 2, may be (other than o) Ln z^ + Ln z^ 
is one value of Ln (2122). 

(iii) ' If 2 is small, there is a value of Ln (i -f 2) approximately 
equal to 2. 

In Chapter 6 we defined and studied the logarithms of positive 
nunibers, but did not find it possible to give any definition of a logarithm 
of a negative number. The new complex definition provides every 
number other than zero with a logarithm. For example, — i — {1,77} 
and therefore one logarithm of — i is In i -i- tt/ = tH. Other possible 
values are 377/, 577/, and in general {211 + 1)771. 

?\iO{e —\\’hen logarithms are being used for numerical calculations 
involving negative numbers it is best to use the special device explained 
in Section 6.y. 
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PROBLEMS 

(1) Show that one value of Ln i is 

(2) Show that one value of Ln is 377/. 

(3) Find Ln (i + i). 

(4) Find Ln (—^). 

(5) Show that Ln z = Ln z. 

14.20 Complex exponentials 

For real numbers the function X = exp x was defined as the natural 
antilogarithm; i.e. the relations X = exp x and x = In AT mean the 
same. The same definition can be applied to complex numbers: if 
2 = Ln Z, then we shall write Z = Exp 2, using for the moment a 
capital letter to distinguish this from the real exponential function. To 
find what this definition means, let us write 2 in the form x + iy, and 
Z in the form {/?, 0 ). Then the relation 2 = Ln Z becomes 

X iy = \n R + i 0 

i.e. a: = In /?,>'=©, or R = exp .v, Q = y, and Exp z = Z = 
{R, 0 ) = {exp X, y}. By the use of equation (14.23) this can altern¬ 
atively be written 

Exp 2 = exp .V (cos jv + ; sin y) . . (14.27) 

Given any complex number z =s x + iy this formula gives us a 
convenient way of calculating Z = Exp 2. 

PROBLEMS 

Find the values of (i) Exp i, (2) Exp ni, (3) Exp (i + ni). 


The alternative form of this relation, Exp {x + iy) = {exp x, y) 
provides a simple geometric construction for the exponential or natural 
logarithm of a complex number. 

Let 0/ be the unit vector. Draw through / an equiangular spiral 
with centre O and angle ^ -- {tt 45^ (Fig. 14.14). As we showed 
in Chapter 6, this spiral is the one defining the ordinary exponential 
function, in the sense that if we draw a line OP' making an angle x 
(radians) with Of and meeting the spiral at P', then the length OP' is 
exp X. Now rotate OP' about O, keeping its length constant until 
It becomes OP where Z.IOP = y. Then the vector OP represents the 
complex number {exp x, y}, i.e. the required exponential. 

Conversely, suppose that a complex number Z is given, and is 
represented by the vector OP in terms of the unit vector 07 . To find 
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Fig. Z4.14 —The exponential of a complex number 


the natural logarithm of Zy draw a circle with centre O through P to 
meet the spiral at P\ Then 

Ln ^ — /ilOP + i/_IOP' . . (14.28) 

provided that the angles are measured in radians. 

The relation Exp z = exp x (cos y + i sin y) — {exp Xy y) 
shows that the complex exponential is a single-valued function: given 
the value of Zy that of Exp z is completely determined. Furthermore 
if z is real (so that s' = a:, = o) we have Exp z — {exp x, 0} =s 

exp X = exp z. Thus the complex and real exponential functions 
agree exactly in value whenever z is real, which is of course the only 
case in which the real function is defined. So there is really no point 
in distinguishing between the two functions, which are always equal 
when both are applicable, and we shall henceforth use the same symbol 
exp z to mean either function, as the context allows. Furthermore 
the definition of Z — exp z as meaning the same as the relation z = 
Ln Z shows that 

Z = exp z = exp Ln Z . . . (14.29) 

for all Z except Z = o, and irrespective of what particular value we 
choose for Ln Z. 

Query —Is it equally true that ar = Ln exp z for all values of sr? 
We also find from the definition (14.27) that 

exp [zi -h ^2] = exp [(.V, + O'l) + {x^ + *>2)] 

= exp [(a*i + x^) + ;(>’i + J2)] 

-- {exp{xi + .V2). (>-, +>>2)} 

= {(exp .V, exp .V2), (jvj (since Xj, x^ are real) 

= {exp ATi, j,} {exp .¥2,3-2} (by 1414) 

— exp Zi . exp Zz . . . . . (14.30) 

whatever the values of rr| and z^ may be. 
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In the same way it can be shown that 

exp [Zj — = exp z^/exp z^ 

Thus the complex exponential function obeys exactly the same laws 
as the real one. 

Z4.21 Complex roots 

If z is the square of the complex number Z, then it is natural to 
call Z the “square root” of z, exactly as for real numbers, and to write 
Z ~ -y/z. 

The value of Z is most easily calculated by using the polar form, 
writing z= {r, 6] and Z= {/?, ©}. Then the relation Z == \/z 
means by definition the same as = Z“ = Z x Z, that is 

{r, 0 } = {R, 0 } {R, ©) 

= {RRy 0 0 } 

{R\ 20 }. 

This implies that r = R^, i.e. R = \/r, the ordinary positive square 
root of the positive number r. It does not, however, necessarily mean 
(as one might at first be tempted to suppose) that 6=2©, but merely 
that 0 and 2 0 must differ by an integral number of complete turns, 
21m, so that 

2© = 9 zrrn, or ©= ^9 nn 

Thus the possible values of 0 are ^ 9 , ^9 tt, \9 zn, . . . etc., 
and ^9 — tt, 1^9 — z-n, . . . etc., and the possible values of Z = \/z = 
{R, ©} are {\/r, ^ 0 }, {\/r, ^9 n), {y/r, i 9 + zn), . . . etc., and the 

expressions {\/^, ^9 — rr}, { \/r, ^9 — 2tt}, .... etc. But {\^r, 

(v/r, ^6 zn}, {V''. + ^-rr}, . . . etc., are merely different ways of 

writing the same complex number; and the same holds for 

{■s/r, ^0 + 7r), {y/'r, + 37r}, . . . etc. 

So in fact a complex number has just two square roots: 

V" = {V''. or {\/r, {9 + tt}. 

The second root {\/^> + tt} is simply the first rotated through 

TT, i.e. reversed in direction, or multiplied by —i. Thus the two roots 
can alternatively be written as y/z = i (Vr, ^ 9 }, and are calculated 
by the simple rule “take the square root of the modulus r, and halve 
the polar angle 0 '\ But unfortunately it is no longer possible to dis¬ 
tinguish one of them as positive and the other as negative, as in the 
real case, since in general they will be complex numbers. 

These square roots have similar properties to those of ordinary real 
square roots. For example, we can show that the usual process of 
halving the logarithm can be used for calculating a square root; that 
is to say, the number Z whose natural logarithm is ^ Ln c will be a 
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square root of Z. For this number Z is by definition exp Ln S’), 
and therefore 

= exp (i Ln 2) x exp Ln 2) 

= exp Ln 2 ^ Ln -s) (by 14.30) 

= exp Ln 2 
= z (by 14.29). 

PROBLEMS 

(1) Show that \/(«^) = ± 

(2) Show that -\/(~i~2) = ± 

(3) Show that \/(-i/-2)= ± V^ilV^z- 

(4) What number has only one square root? 

(5) What are the values of \''(2i)} 

(6) How many cube roots does a complex number 2 = (r, 6 ) 
have, and what are they? 

(7) Show that \/(cos ^ + f sin d) = ^ (cos + / sin 4 ^). 

14.22 Complex powers 

For any real number x we know that .v® = exp (2 In .v), .v® — exp 
(3 In .v), .v^ exp (4 In .v), and in general .v" — exp (« In x), whenever n 
is a positive integer. For these relations are by definition equivalent 
to In (.V®) — 2 In .V, In (.v®) = 3 In .v, ... In (.v") = « In .v, that is, to 
the rule that the «th power of x is that number whose logarithm is n 
times the logarithm of x (see Section 6.3). 

It is not difficult to show that a similar relation holds for complex 
numbers, using the complex logarithm and exponential functions: 

u" = exp {n Ln ;?)... (14.32) 

whenever ft is a positive integer. For starting with equation (14.29), 
we apply equation (14.30) repeatedly. 

2“ = 2 X 2 — exp (Ln xr) exp (Ln :r) 

exp (Ln XT ^ Ln 2) 

=. exp (2 Ln 2) 

— 2 X 2~~ - exp (Ln 2) exp (2 Ln xr) 

= exp (Ln XT -f 2 Ln xr) 

- exp (3 Ln x;) 

2* ~ 2 X x;®= exp (Ln xx) exp (3 Ln xx) 

= exp (Ln XX -f- 3 Ln xr) 

— exp (4 Ln .sr) 

and so on up to any desired power. 
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From the definition of the exponential function as the natural anti¬ 
logarithm it follows that equation (14.32) can equally well be stated 
in the form “If Z and z are such that (some value of) Ln Z is n times 
(some value of) Ln then Z = z'^". Thus powers of complex numbers 
can be found by taking logarithms in the usual way. 

Furthermore, since exp 0=1, we have by (14.31) 

exp (—n Ln z) = exp (o — n Ln z) 

= (exp o),'(exp ft Ln z) 

= I / z '^ 

So, just as with real numbers, it is natural and convenient to write 
ifz^ as ar”". All the usual rules of indices hold, such as that z"'z’' = 

SO on. Since we have also shown in the 
previous section that exp (^ Ln z) is a square root of z, it is natural 
to write \/s as z^, and also y/'z^ as z\ i/v^-s^ ^s z~^, and so on, exactly 
as for real numbers. Again all the usual laws will hold, with proofs 
exactly similar to those of Section 6.10; except that in the complex 
case there will be some caution necessary since \/z is now a two¬ 
valued function. Thus we may have z^ = z . z^ or z^ = — z . z*, 
according to the values chosen for the square root. 

This gives a definition of z" whenever n is an integer or a simple 
fraction such as ^ or — 3 . It is tempting to generalize this definition 
to complex powers, and to write exp (m Ln xr) as z'‘ for any complex 
values of z and u. This is indeed the standard definition of z". But this 
notation is rarely used in practice because in general it has an infinite 
number of different values corresponding to the different values of 
Ln z, and they are difficult to keep under control. 

It is, however, customary and convenient to use the symbol 
where is a positive number, for the quantity 

/?* = exp (c In/f) . . . (14.33) 

with the convention that we choose the ordinary real natural logarithm 
for In k. (This is only possible if k is positive.) With this convention 
k* becomes a single-valued function of z and /f, and all the usual laws 
of indices hold without any restriction. In particular if we take k to 
be e we have 

e* = exp (z In e) = exp z 

So that the function exp z can also be written as e*, exactly as for the 
real function. 'I he equations (14.29), (14.30), (14.31), and (14.27) can 
then be written in the form 

e---- = T • (■ 4 - 34 ) 

= {e*, >») = (cos y -f f sin _y) ) 

From these equations we can deduce that 
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gZTJi ^ glni ^ gH-rri _ _ _ j v 

e*-' = {e». i^} = {!,’ \n} = i - . . (14.35) 

gZ+2^t ^ gZ _ g2ni _ gZ 

The watchful reader will notice that the definition (14.33) 
where k is positive, agrees exactly with the old definition (6.15) when 
z is real. But it does not necessarily agree with the definition (14.32) 
when 71 is fractional. Thus 2* would mean the positive square root of 2 
if interpreted according to definition (i4.33)» but according to (14.32) 
it could mean either \/2 or — v2. This is an unfortunate ambiguity, 
but on any occasion in which such a symbol arises the context will 
usually make the meaning clear. 

14.23 Complex trigonometric functions 

In Section 6.14 the so-called “hyperbolic functions” cosh a: and 
sinh a: were defined as and — respectively. 

This definition can readily be extended to any number z, writing 

cosh z = \(e’ €'“-) ) 

sinh z = \ (e- — e““) i 

PROBLEM 

(1) Express cosh {x iy) and sinh (x iy) in terms of x and y. 

Now if we put .V = 0 in (14.34) we obtain the remarkable formula 

e* = cos y i sin y. 

Replacement of y by —y in this equation gives us 

= cos (—>■) + i sin (—y) 

= cos y — i sin y 

whence e~*'' — 2 cos v 

e' — e~* *' = 21 sin y 

i.e. cosy = (e"' -f e-*'')l2, siny = (e*'' — e-*'’)j2i 

FURTHDR PROBLEM 

(2) Using the construction for exp z = e‘ given in Section 14.20 
show that these results are true. 

These equations hold for all real values of y. It is tempting to 
generalize them to complex numbers, defining 

sr = -I-e-*=)/2, sin :r = (e'* — e“'*)/2/ . (14.37) 
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for all values of z. Since e’* and e-** are one-valued functions of z, 
it follows that cos z and sin z are also one-valued functions; and these 
definitions will agree with the usual ones when z is real. They can no 
longer be directly interpreted as trigonometric ratios: but we can use 
the geometric construction for an exponential given in Section (14.20) 
to find the values of sin z and cos z for any value of z. Another 
method is to use equation (14.34): 

e‘= == v> ^ ^(-v+rr) ^ (cos .V -f- / sin .v) 

= ^‘'(cos .V — / sin .v) 

whence 

cos z = cos (x + ty) = i (e‘* -j- e”**) 

= i (e~'' + ^*') cos X + if (e~'' — e") sin x 
= (cosh y . cos .v) — i (sinh y . sin .v) 

and similarly 

sin z — (e‘' — e~‘')!2i = (cosh y . sin x) + /' (sinh y . cos .v) 

and if we know the numerical values of x and y we can use these equa¬ 
tions to calculate cos {x -f- iy) and sin (.v 4- zy). 

Notice that equation (14.37) can be written as cos z = cosh iz\ 
sin z = (sinh iz)/t which shows how the trigonometric and hyperbolic 
functions are related. This explains the remarkable similarity of their 
properties, which seems very mysterious when we consider real numbers 
only. 

The remaining trigonometric functions can be defined as follows: 

tan z — (sin z)l(cos z) sec z = i/(cos x:) 

cot z = (cos 2r)/(sin z) cosec z -= i/(sin xr) 

exactly as in the real case, and similarly for the hyperbolic functions: 

tanh z = (sinh sr)/(cosh xj), etc. 


PROBLEMS 


(3) Find cos (1 4 - 1), sin (i 4 - i»r/). 


(4) Show that tan z = ; 


e** — 


for cot z, sec z and cosec z. 


i (e‘* 4- 


(5) Show that tan z = (tanh /xr)/i, and 
the remaining functions. 


and give similar formulas 
give similar formulas for 


Now the definitions cos z={e*‘-\-e-**)f2 and sin x?=(^‘*_e-'*)/2f 
can be used to deduce the properties of the general cosine and sine 
functions, and any properties so deduced will automatically hold for 
the ordinary real cosine and sine functions. That is the chief practical 
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importance of these definitions. We scarcely ever need them to calculate 
cos z or sin z for complex values of z\ but we can use them to deduce 
the properties of cos z and sin z^ and they are often easier to manipulate 
than the geometrical constructions of Chapter 5. 

For instance, we have immediately from the definitions (remem- 


that e 

,2W< _ 

g- 2 ni 

= I and = e-”^ = - 

-!)■ 

cos 

0 


(e® + e®)/2 == (i + i)/2 

= I 

sin 

0 


0 

II 

0^ 

1 

0 


cos 

77 


^ 1 - 

- l)/2 = ■ 

sin 

77 

= 

1 ■ 

-[-l])/2 

cos 

277 


(e^^* H- e-^"*)l2 = (i 4- 

H 

II 

cos 

(^ + 

277 -) = 

^gzi+ 2 iri g-zi- 2 iTi'^l 2 




(gi‘g2ffi _j_ g-zig- 2 itiy 2 





(e*^ 4. e~‘‘*)j2 = cos z 


cos 


it) = 

4 - 





g^zig-ni^j 2 





(— — e~‘*)l2 = — cos z 

( CCi^ STV 

fsin 


V* — g-* 

1 ^WO w f 

\Oili < 

V 2 / ^ V 

. 2( 



(«'= 

)*4-2e‘'«”*'+(e”*“)® {e* 



4 4 

= 4c'^e“*74 = 4 c'®~^74 = ^ 

(cos =)= = {f::)i±.2£'_:£2i±(f:2)! 

4 

= \ (e2‘-- + 2 4 - 

= f (2 cos 25 r -f- 2) = A (cos 2z -f- i) 

We also have directly from the definitions 

cos z i sin z — e*‘ 
cos z — i sin z = - . 

whence 

_ giz, g(zj _ a. I 2r,)(cos Zi + i sin jyg) 

= (cos iTi . cos 2:2—sin z^ . sin (cos z^ . sin ^rg+sin z^ 

and similarly 

= (cos S'! . cos 22’~sin Zi . sin Z2)—t (cos sr, . sin iTo+sin Zi 
whence 

cos (^1 + ^2) = 

= cos Zi . cos Z2 — sin . sin Z2 
sin (^1 + ^2) = — f-*(=>+=-*>]/2* 

= cos z^ . sin Z2 + sin Zi . cos arg. 


(14-38) 


cos a^a) 


cos Z2) 
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FURTHER PROBLEMS 

Using the definitions (14.37) prove the following relations: 

(6) cos {~z) = cos z. 

(7) tan (—s:) = — tan z. 

(8) cos (Jtt — z) = sin z. 

(9) sin {z + \tt) = cos z. 

(10) cos (ar, — Z2) = cos z^ . cos Z2 + sin z, . sin Z2- 

(11) sin Jtt = I. 

14.24 Complex limits 

Suppose that v = OP is a variable vector, and V = OQ a fixed 
one. Then it is natural to say that ‘V tends in the limit to V" if the 
point P tends to Q, i.e. if the distance PQ becomes smaller and smaller 
and tends to zero. Now the vector PQ = OQ — OP — V — v, so 
that the distance PQ is by definition \y ~ v\-, to say that ‘V tends 
to y” or that ”v-^y” will therefore mean by definition that | y —»»|-^o. 
Now \ y — I'l is not a vector but an ordinary positive number; so this 
definition reduces the idea of the limit of a vector to that of the limit 
of an ordinary number. There is no need to introduce any really new 
idea. 

Similarly a complex number z will be said to tend to a limit Z if 
\Z -- z\ -> o. 

It is now a fairly simple matter to show that these limits obey the 
laws we should expect. As an example we shall prove that if Zi a 
limit Z,, and a limit Z2, then Zi Z2 Z^ + Zg. For 

4 - Z2) - {z, 4 - ^2)1 = \{Z, - z,) + {Z2 - Z 2 )\ 

< \Zi — Z^\ 4 - 1^2 — Z2\ [by (14.19)] 

But (Z, — Zil o and IZ2 — Zzl -> o by hypothesis, and therefore 
|(Z| 4 - Z2) — (zi 4 - Z2)\, which is less than their sum, must also tend 
to o. Similarly (with a little more trouble) it can be shown, for instance, 
that ZiZ2 -*■ Z,Z2. 

> 4'25 Differentiation of complex functions 

Let w be a function of a complex number z; e.g. to = z^, or to = 
sin z. 

Then if a»j is the value of to when z = and Wg the value when 
^ = Zt, we shall call Bw = W2 ~ to, the “change in to”, Sz = Z2 ~ z, 
the “change in x:”, and the ratio Bto/Bz the “average rate of change of 
w”. If this ratio tends to a definite limit L as Sar ->■ o we shall say that 
“tt> is a differentiable function of z” and call L the “derivative”, 
“derivate” or “differential coefficient” of to with respect to z. We 
write L as dzo/dz, or D, to, or Dto, or tv,, or tv', exactly as for real variables 

DO ' 
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EXAMPLES 


(i) «; = ar. In this case iOi = Ziy — Sw — — 

— Zi = hzy and therefore hwjhz = i. But as ^ o the number i 
tends to the limit i, since it is a constant anyway. Therefore D^z = i. 


(2) — exp z. In this case = e'\ = e~^ = so 


that 


exp z 
Sto 


tv^ — io. = — e- 


— 2 


— 

= — 1) 


But when ^z is small, e^’ cs i + Sar (equation 14.29), so that 
Swc^e’^Sz. Thus SwjSz = e-, and in the limit we have D^io = 
— e-' = (where the suffix , is omitted as no longer necessary 
when z^ and z^ become indistinguishable). 

(3) w = Ln z. Here «), — Ln z^, W2 — Ln ^2 — («i “b “ 

Ln [z^ (i + Szlz^)] = Ln arj + Ln (i + Ssr/xr,), so that 

Sw = Wa — “'i = Ln (i + 5«/«i)- 

Now when hz is small, Bzjzj is small (provided that arj ^ o), and 
by the definition of the complex logarithm we know that one value of 
Ln (i + Ss/^i) ^zizi. (Here we should be a bit careful, since the 
logarithm has many values differing by multiples of 27 rt‘. But it goes 
almost without saying that in differentiating w — Ln z we shall take 
for W2 = 'Ln Z2 that value of the logarithm which differs only slightly 
from tVi = Ln z^, so that the difference Sw = Ln (i + Ss/si) will be 
a small quantity without any multiples of ztti added on. It must then 
be true that Ln (i + hzjz,) ^ Bzjz,). We have therefore Sw - Bzjzi, 
or hwi'Sz^ i/ui. The smaller Bz is, the more accurate this will be. 
and in the limit as 8c —> o we shall have 2 ). Ln z ~ i, c. 


Now all the usual rules for differentiation will hold, since the proofs 
already given for real variables can be translated almost word for word 
to apply to complex variables. So, for example, by the product rule, 

- D=(cc) -= (/)=c) c -i- XX {D^) = c + c = 2C 

By using the rules for products, sums, and functions of functions, we 
see that 

jD, cos z = D.[\ (e=' + e-"')] 

= lie=* - 

= — (e*' — e~-')l2i = — sin z 

and similarly D- sin z = cos c, just as in the real case. Any other 
function which can be built up by addition, subtraction, multiplication, 
division, exponentials, logarithms, sines and cosines can be differentiated 
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in the usual way: and that means almost all functions which occur 
in practice. We can go on to second, third, and further derivatives. 
We cannot however speak of “maxima” and “minima” of complex 
functions, since as a rule we cannot say of two complex numbers that 
one is greater than or less than the other. 

It is also possible to consider series of complex numbers, and the 
definitions of “convergence”, “geometric convergence” and the tests 
for convergence will be the same as for real series. In particular we 
can have a Taylor series 

w = {s — z^) + {z — ^,)7|2 4- . . . 

where the suffi.x (,) means that we have to put z = z^ after differentia¬ 
tion: and as a special case if = o we have 

to = Wq [to^] oZ + [to^g] o^V |2 4- . . . 

where the suffix („) means that we put z = o after differentiation. 
Now one could try to prove that this series converges and gives the 
correct answer by the same methods as for real variables. But for¬ 
tunately there is a general theorem which states in effect that if the 
function to can be differentiated once, to give to^ = D,to, then it can 
be repeatedly differentiated (to give to-^ — D.^to, etc.), and that 

the Taylor series must converge for sufficiently small values of to and 
must give the correct sum. In fact to be more precise there must either 
be a positive number R (depending on the particular function to and 
particular value z^ that we choose) such that the series is geometrically 
convergent when |x? — .s,| < /?, or else (as for sin ir, and cos ir) 
the series is geometrically convergent for all values of z. R is called 
the “radius of convergence”. This powerful theorem is unfortunately 
not very easy to prove. (Sec any standard text-book on complex 
variables.) But since all the functions met with in practice can be 
differentiated, this theorem implies that their Taylor series will be 
valid, and in particular this covers the geometric, exponential, logarith¬ 
mic series [for In (i 4 - ~)] and binomial theorem exactly in the form 
we have obtained for real numbers. I'hus 

e* = I 4- ;r 4- 2rV|2 4- 2rV|3 4- - . . 

In this series put firstly iz and secondly —iz in place of c: we get 

= I 4- iz — ^2/12 — *2^13 4- 2rV[4 4- . . . 

I — iz — ^2^12 4- *~3/|3 4- 5V|4 — . . . 
respectively, whence by addition and subtraction 

cos z = (e‘* 4- c~'*)/2 = I — ~2/j2 4- :yV|4 — ■ . . 
sin ;r = (f'* — e-^*)/2* = z ~ z'^j\S — . . . 

This shows how the cosine and sine series arise. 
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If the function is not differentiable when z — Si the Taylor series 
will fail. Thus ijz and Ln z cannot be differentiated when sr = o, since 
they are then infinite, and so they have no Taylor series. It is perhaps 
also worth noting that w = z, the complex conjugate of z, cannot be 
differentiated for any value of z according to our definition, and so 
will not be expressible as a Taylor series. For if z — x iy, then 
z = X ~ iy. Thus 

Bz=Zz~Zi=(x2-\-ty2)—(xi'\-iyi)^{x2—Xi)-\-i{yz—yi)^^x-\-iBy 

hz=Zi-Zi=ix2-iy2)—ixi—iy^)=(xi-Xi)—i(y2~yi)=Bx~tBy 

Bz/Bx = (Sx — iBy)l{Bx + *Sy), 

Now the essence of differentiability is that when Bz is small the 
quotient BwjBz will approximate to a certain limiting value, the deriva¬ 
tive. But one possible way of making Bz small is to take 8 a; small and 
= o exactly, when the formula above shows that Bzj Bz = 1. Another 
way is to take Bx = o and By small, when BzjBz = ~ i. So, however 
small Bz may be, the quotient Bz/Bz can take the values i and — i, as 
well as other possible values, and therefore does not approach any 
definite limit. It follows that any function which involves z, such as 
sin z or z- or \z\ = will not as a rule be differentiable, and cannot 

be expressed as a Taylor series. 


14.26 Integration of complex functions 

If a function W — F{z) has its derivative D^W — Fs{z) equal to a 
given function, w = f{z), then W is said to be an “indefinite integral” 
of zv, and we write 

W = J w dz. 


Since the rules for differentiation are the same as for a real variable it 
follow’s conversely that the rules for integration will be the same: e.g. 


f z dz = ^z^ -h C, 



Ln z C, and so on. It can be shown 


that if w is a differentiable function of z then it has an indefinite integral 
W= F{z) — f wdz. In fact, on taking the Taylor series for w, say 


ttj = Co -f- a\Z + floS* H- . . . , 


and integrating it we obtain 

C + a^z + aiSrV2 + a^z^jz + • • • • 

It is clear that if the series for W converges, then w’e obtain that for v) 
by differentiation, and therefore W is the indefinite integral of w. It 
is not difficult to show that the W series does converge for sufficiently 
small z*. 

* Actually the formal proof of this theorem to be found in more advanced books 
follows quite different lines. But the reasons for that need not concern us here. 
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This method allows us to integrate any function which is likely to 
arise in practice. One instance is the “normal integral” f “ dz 

V 27 r J 

which is of interest in statistical theory. By the exponential series 


= I - + 


and therefore on integration, 


^ .4_J 


412 


o|3 2 '\r 


2r 


+ • • • 


dz=C-hz~lz^ + — z^ — ... + + . . . 

6 40 2’^+Vjr 

The normal integral is obtained by multiplying this series by 
( 27 r)~J. The product is geometrically convergent for all values of z, 
and can therefore be used for calculation. 

It is also possible to define a complex definite integral y*' w dz 

= F{z^ — where F{z) = W is the indefinite integral. This 

again will obey all the usual rules for definite integrals. 

In short we see that almost every process used on real numbers is 
a particular case of a more general process concerning complex numbers. 
It is true that complex numbers as such rarely occur in practical prob¬ 
lems where all quantities normally occurring are real. (But complex 
numbers as such can be used to study vibrations, or in plane geometry.) 
Yet, even when all the quantities concerned are real, it can be helpful 
to use complex quantities as an intermediate stage in the calculation. 
We proceed to demonstrate this in the next chapter. 




CHAPTER FIFTEEN 


SOME USEFUL INTEGRALS 


15.1 Trigonometric integrals 

Certain integrals can be readily evaluated by the use of complex 
numbers. Many of these spring from the formula J dz ^ e^^jA 
+ C, which applies whether x is real or complex, provided that A is 
not zero. (If ^ = o the integral becomes J dx ~ x C.) 

Consider for example the integral of e® cos x. By formula ( 14 . 37 ), 


J e® cos X .dx^ J (e‘® + e-*) dx 

= \ f (eV® +6^6-^^) dx 

[e(i+0* 

^(1+0* ^ fO-O* ^ ^ 

2(T+T)'^2(i-0 



This is a valid answer, but rather awkward to use in this form 
owing to the presence of complex functions. But it can be reduced to 
a real form. To do this we first bring the fractions to a common 
denominator 2 ( 1 + f)(i — *) = 2 ( 1 ^ — (“) = 4- Thus 


/ 


c® cos 


.V dx = 


(i -f( i H-£)g<i-o^ ^ 

2(1 +/j(i -0 
= l[{i - i) + (I + 0 

= H(i - 0 + (I + 0 ^ 

= ie® (I - 1) + (I + i) e-®] + C 

= le' (e‘® + e-*®) — * (e*® — + C 

= [(2 cos .v) — i (21 sin .v)] + C 

= ie® [cos x + sin .v] 4- C 


This can be readily checked: on differentiating the right-hand side 
we do in fact get e® cos .v. 


PROBLEMS 

(1) Show similarly that f sin x . dx = ^e® [sin x — cos .v] + C. 

( 2 ) Find f cos z-v . dx. 
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Now consider J cos Ax . cos Bx . dx. 

Firstly we have from (14.37) 
cos Ax . cos Bx = i J 

_ J r I x^^Bi i ,j x y4i B* 

— J '^{A -hD)9 X ^g(A^B)i X x M +B)^ xj 

— J ^-^+B)ix _j_^-(^+B)i xj _j_ I ^^.4-B)rfx_j_^—<^-.B)ixj 

= 4 cos (.<d+ 5 )a:+ i cos (/i —. . (15.i) 

Now if K =j^ o, J" cos Kx . dx — (sin Kx)lK + C, as follows either 
from our standard formulas, or from 


/ 


cos 


Kx . dx = J i dx 

= + C 

= _ e-^'^yziK + C 

= (sin Kx)IK + C. 

Therefore if .<d # J 5 and ^96 — B we have 

r A D j sin (.'J+/?).v , sin(^— 5 ).v ^ 

f cos Ax . cos Bx.dx^ -^(A+Bf + * (A-R^ + 


2 {A —B) 


(* 5 - 2 ) 


The exceptional cases in which A = -i: B are not difficult. \f A = B 
9^ o then (16.1) gives us (cos AxY = ^ cos zAx + 4» so that in this 
case 

J' (cos Axydx —■ (sin zAx)!^ + xjz + C . (15.3) 

If ^ ^ 9A o we get the same result. Finally if A = B = o 

integral becomes J i dx = x + C. 

Similarly we find that if A ^ B and A ^ — B, 

sin {A +B)jic 


the 


r ■ A ■ i> J sin(/I — 5 )j: 

/ sm Ax . sin Bx . dx = . . _ - — - 

J zi^A —B) 

r ■ A tj I cos(.'I+fi)x 

/ sm yJ.v . cos Bx . dx — — / a : n\ 

J z{A +B) 


2 {A +B) 


+ C 


cos {A—B)x ^ 


z{A~B) 

ar.d the exceptional cases can be derived from the formulas (A ^ o): 

J (sin AxY dx = xjz — (sin zAx)l^ + C 
J' sin Ax . cos Ax . dx = — (cos zAx)/^A -\- C 

In general this method enables us to evaluate any integral of the 
form y" e'*'cos a: . cos . sin with any number of cosines 

and sines. 


(*5-4) 
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We can also integrate For since J we have 

by the “integration by parts** method 

f dx = x {K~^e^‘) - f dx 

= + C 

A further integration by parts gives us the integral 

f x^^^ dx = - 2K-^ + 2K-^e^=^ + C 

and in general if n is any positive integer, 

J dx 

= ic-"-* in . - . . . + (-i)"1 + c 

I- L \n |«—I |«—2 J 

(15-5) 

as can be verified by direct differentiation. This allows us to integrate 
expressions like x^ sin .v, x cos x, x cos x, and more generally of a 
form like cos B^x . cos B^v . sin B^x. 

EXAMPLE 

(i) Find J X cos Ax . dx. 

J X cos Ax , dx J dx 

^iz g-Ais ^ix g-Aix 

= *- UJ— +- TA^— 

= {.V sin Ax)!A + (cos Ax)IA^ + C . . . . (15*^) 

Similarly J x sin .-^.v . dx = — (a- cos Ax)/A + (sin Ax)IA^ + C. 


FURTHER PROBLEMS 

(3) Find J x^ sin a . dx. 

(4) Find J X sin a . cos a . dx. 

(5) Find J X cos a . dx. 

15.2 Vibrations 

Consider a body near a state of equilibrium. Denote its position 
measured from the equilibrium point by j (we use the term “position’* 
for convenience; but a similar analysis can be applied to any state of a 
system in equilibrium, e.g. y could mean temperature, or volume, or 
any other convenient property, provided that the equilibrium point is 
taken as the origin, y = o). 
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If the equilibrium is stable there will be a force tending to restore 
the system to equilibrium whenever it is slightly disturbed. The 
simplest assumption that we can make, and one that will often be nearly 
true in practice, is that the restoring force will be proportional to the 
displacement y. So the acceleration of the body will be proportional 
to^', but in the opposite direction, and we can write 

l>cy ==yit=- Ky 

where t stands for the time and is a positive constant. It will be 
convenient to denote by a>, so that this equation becomes 

yti=—^-y- • - ■ ( 15 - 7 ) 

Now it has been shown in Section (13.12) that the solution of this 
equation isy = p ^ cosojt + {pjio) sin lut, where />o ^nd p, are arbitrary 
constants. (More precisely we have shown that this is the only solution 
with a Taylor series about ti = o; in Section 16.26 it will be shown 
that there are no other solutions.) Since Pi is arbitrary', (Ptloj) is also 
an arbitrary constant, and we can equally well write this general solu¬ 
tion in the form 

y = a cos cut b sin cut . . . (15.8) 

where a and b arc arbitrary constants. 

(a = Po and h = Pj/cu). A motion of the 
form specified by (15.8) is known as 
“simple harmonic”. There is another 
way of writing it. Draw a right-angled 
triangle with sides a and h (Fig. iS-i)- 
Let A be the hypotenuse, and ^ the 
angle opposite b, so that A =■ 
a = A cos <f>, b -- A sin 'Fhen 

y =. 0 cos ojf -f- A sin of 

= A cos <f> . cos ojt -1- A sin <f> . sin u>t 
= A cos {uit — 4 >) . . . . . (t5-9) 

I f <^ = o this gives an equation y ■= A cos uit. 'Fhis means that when 
we plot y against t we obtain a wave-form for the graph, like that of 
cos t (Fig. 5.9). 'J'he constant A is the greatest value which y can take, 
and is called the “amplitude” of the vibration. If <{> is not zero the 
effect is to delay the vibration by a constant interval of time <^/a*, since 
ojt — ^ = oj{t — that is to say, the whole graph is shifted along 

the {-axis by the amount •i>jcu but otherwise unaffected. 'I'hc constant 
<t» is called the “phase”. 

When t is increased by z-njoi the displacement y returns to its former 
value, for then A cos [oj (/ -f- ztt/cu) — <fi] ~ A cos [lut — 0 - 1 - ztt] = 
A cos [ojt — 0 ]. The vibration has an exact period znjoi, or frequency 
oj/ztt. The constant tu obtained by multiplying the frequency by 2n 
is sometimes called the “pulsatance”. 



Fig. 15.1—Amplitude A and 
phase <i> of harmonic motion 
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Many mechanical vibrations are of this form: the waves on a pond, 
the vibrations of a tuning fork, the note of an organ. Now our calcula¬ 
tion depends on two assumptions. The first is that a displacement 
from equilibrium produces a restoring acceleration or second deriva¬ 
tive. This is true for mechanical oscillations. But 
it will not be true for a change in temperature, 
for which Newton’s law of cooling states that the 
displacement in temperature produces a propor¬ 
tional rate of change, or first derivative, tending 
to restore equilibrium. In that case the tem¬ 
perature difference falls off according to an 
exponential law', without oscillations. The second 
assumption is that the restoring acceleration is 
exactly proportional to the displacement, accord¬ 
ing to the law Consider here a 

pendulum of length L subject to a small dis¬ 
placement y' (Fig. 15.2). Then the angle 6 from 
the vertical is given by or 0 = yjL. If 

the mass of the bob is m, it is acted on by a 
downw'ard force mg, and the component of this 
perpendicular to the string is ~mg sin 6 , or 
~mgd approximately, since 6 is by hypothesis 
small. 7'he acceleration of the bob = force/mass 
= — mgdj'm = gd — — gy/L, the minus sign 
indicating that it is tow'ards the equilibrium position. That is, 

yu = — gy!^ = — <»'^y where W = VigJL). 

Thus, by our previous argument, the pendulum undergoes simple 
harmonic motion with period zn-m = 2TT\\L;g). (But if the displace¬ 
ment is large this approximation fails, and the motion is no longer of 
this type.) 

In many cases of periodic motion the proportionality bet^veen 
displacement and acceleration breaks down. When a note is played 



Fig. 15.2—Simple 
pendulum 



on a violin the string tends to stick to the bow for a certain time, and 
then return very rapidly to its former position. The motion w'ill be 
of the form shown graphically in Fig. 15.3, w'ith a straight ascent, and 
a sharp peak, but nevertheless exactly periodic. Now' when we listen 
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to the playing of a single note on a violin (or an organ, or a piano) we 
hear not only a note of period equal to the period of vibration, called 
the “fundamental”, but also “overtones” or “harmonics” with periods 
h i ... of the fundamental period, i.e. with frequencies exactly 
multiples of the fundamental frequency. Thus it seems that such a 
note can be thought of as made up of a mixture of the fundamental 
and overtones in varying proportions, and the human ear has a mech¬ 
anism for separating these out. 

Now it is fairly clear why the overtones or harmonics must have 
frequencies which are exact multiples of the fundamental. Suppose 
for instance that a note has an exact frequency of 500 cycles per second. 
Then a component of frequency 1000 or 1500 per second will neces¬ 
sarily repeat itself exactly every i/500th of a second, and so will fit 
into this period; but a component of 700 cycles per second could not 
do so. In general let us suppose that the fundamental note has fre¬ 
quency cu/arr. It is then represented by a, cos cot -r A, sin ojf, where 
Qi and bi are constants, and t stands for the time. A note of twice 
this frequency is represented by a., cos zojt -p 63 sin one of three 
times the frequency by cos 2 ^^ + and so on. So if the 

observed displacement y can be broken up into a combination of 
harmonics it must be of the form 

y = ^7 0 -|-a, cos to/ -f sin cuM-<?2 cos ziot -} b., sin 2ojt -f- . . . etc. 


We can readily separate these terms by using integral calculus. To 
simplify the calculations we shall take the unit of time to be such that 
<0=1, i.e. that the period is 27 t. (If not, this can be arranged by a 
suitable change of units.) Now suppose first that y can be analysed 
into a finite number N of vibrations: then we can write 


>» = ao-t-fli cossin / f . . . cos Nfb;^> sin A't . . (15.10) 


The device which solves the problem of finding the a's and b's is as 
follows. From formulas (15.2) to (15.4) above we see that, if m and 
n are zero or positive integers, but m ^ n, 



cos mx . cos ;/.v . dx = f cos mx . sin 7;.v . dx 

J 0 


/ Zrt 

SI 

0 


sin mx . sin nx . dx 


o 


since sin 2 n{m -1- //) = sin 27 t(w — n) — o and also cos 27 t(w 4- «) =: 
cos 2rr{m — w) I. Also if m n 'A o we have 


(cos nxy dx = (sin «.v)2 dx = it. 


r 

J 0 


COS nx . sin nx . dx — o. 
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The series (15.10) for y becomes, on multiplication throughout by 
cos 

y cos nx — cos tix a^ cos x . cos nx + sin x . cos nx + 

... + cos nx . cos nx + sin nx . cos + . . . 

+ cos Nx . cos nx 4 * b^ sin Nx . cos nx. 


On integrating this from o to Zn we shall therefore obtain a zero 
integi al from every term on the right-hand side except cos nx . cos 
which has the integral Thus (for n ^ o) 



y cos nx . dx = f^‘‘ On cos nx . cos nx . dx = nOn 
Jo Jo 



77“^ y . cos nx . dx 


(15.11) 


Similarly if we multiply equation (15.10) through by sin fix and 
integrate we shall find that every term will give a zero integral except 
b„ sin w.v . sin nx, which has integral 6„7r; 


i.e. 


f y sin nx . dx = f ^ bn sin nx . sin nx . dx = Tibn 
Jo Jo 

. (15.12) 


bn = 77“^ J~^ y sin nx . dx 


Finally to find Oq we simply integrate y as it stands from o to 277; 
every term but the first has a zero integral, and therefore 


y dx — a^dx — zna q 
J 0 ^ Jo 

i.e. flo = ( 27 t )“1 y .dx . . . (15.13) 

Thus each coefficient or b, can be expressed in the form of an 
integral. 

However, in general, a periodic wave-form y will not be expressible 
exactly as the sum of a finite number of harmonics. But it is natural 
to suppose that it can be expressed as the sum of a convergent infinite 
series: the first few terms would represent the fundamental and simplest 
harmonics, while the tail end of the series would consist of higher 
harmonics of small amplitude and negligible effect. 

y^«o-f <7, cos sin / -f a2 cos 2/+63 sin 2/ + . . . to infinity 

(15-14) 

It is also natural to suppose that the coefficients 77„ and b„ can be 
calculated from the integrals (15.11), (15.12) and (15.13). In fact 
these as.sumptions are true tor a very large class of functions y: this is 
known as Fourier’s I hcorem”. But unfortunately it requires con¬ 
siderable ingenuity to prove the result properly, as the series (i5.i4)» 
known as Fourier’s series, can often be very slowly convergent. We 
slial! simply have to accept the result for our purposes. 
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To take a concrete instance, let us consider the motion of a violin 
string as shown in Fig. 15.3. To simplify the calculations the graph 
will be slightly idealized by assuming that the return motion is instant¬ 
aneous, as suggested by the dotted lines. We then have the relation 
y = Kt for values of t from o to ztt, and therefore 

a„ = ( 277 )-* J‘‘" ydt = (277)-' f-^ Kt dt 

= ( 277 )-! 

Qn = V cos nt . dt = 7 r“* Kt . cos nt . dt 

y 0 y 0 

= 7r“^ K [m“^ t sin nt + cos nt]‘^‘" = o 
b„ = 7 r~^ y sin nt . dt — 77-* J'"'' Kt . sin 7 it . dt 
— 77“* K [— t COS nt n^^ sin == — zAT/i"* 


So y is represented by the series 

2 2 

y = K\ 7 t — 2 sin t — sin 2/ — - sin 3/ — . . —- sin r/ — . . .] 

3 

and the successive harmonics do in fact decrease in amplitude, but only 
very slowly. 

If the value of y is not given by an algebraic expression, but by a 
graph or set of numerical values instead, it is still possible to find 
and bf. by the use of numerical integration. A suitable way of doing 
this is suggested by the following interpretation of the integrals (15.11), 
(15.12) and (15.13). As t varies in value between o and 277, is the 
average value of y, a„ is twice the average value of y cos nt, and b„ 
is twice the average value of y sin nt. Imagine then the interval from 


o to 277 divided into twelve equal parts by points Ti = o, T^~ 

(= 3 o'’)» ^3 = ^ 7 - (= 60^). = ^7 9®^)' on up to 

o o 


7 T 

6 


(= 330*^). The final point T, 3 ~ 277 will simply repeat the first, 

since y is periodic, and can therefore be ignored. Let Fj, ¥», . . . 
1^12 be the corresponding observed values of y. Then Uq will be 

approximately the average value (F, + 4 - . . . + F12); a, 

will be approximately the doubled average ^ ( F, cos o® + F2 cos 30° 
4 - F3 cos 60® 4- . . . 4' Fjj cos 330®), and will be the doubled 
average {y 1 sin 0° + sin 30® 4 . • 4 F,™ sin 330®). will 



BIOMATHEMATICS 


414 

be twice the average of cos zT^) for values of r from i to 12, and 
so on. If greater accuracy is required the number of points must be 
increased, and various schemes have been worked out to reduce the 
labour of computation (see E. T. Whittaker & G. Robinson, The 
Calculus of Observations^ Blackie, 1937, Chapter 10). 

The ability of the human ear to separate a note into fundamental 
and overtones suggests that it acts in a similar way to a Fourier analyser. 
But comparisons between sounds as heard and as mechanically recorded 
show that it does not, or at least not very accurately. The fundamental 
pitch can sometimes be heard very clearly in a note which exact analysis 
shows to consist almost entirely of overtones: this is probably due to 
distortion of the sound in the auditory channels. 

It has been many times suggested that a Fourier analysis may be 
helpful in finding the causes of a vibration. This may be so in the 
study of musical notes, and in X-ray crystallography, or in tracking 
sea-waves, or even in analysing the components of an electrocardiogram. 
But it is doubtful whether it is helpful in studying biological periodic 
phenomena in general, since the mechanism which causes fluctuations 
in the number of a species from year to year, or causes the nervous 
shaking of the hand when held out, may not be at all analogous to the 
process which causes the swing of a pendulum. The decomposition 
of such a process into harmonic vibrations may form a simple means 
of description, and is always mathematically possible: but it can be 
irrelevant to the explanation. 


PROBLEMS 

(1) A periodic function y has the value o when t lies between o 
and 77 , and the value i when t lies between tt and 277, and thereafter 
repeats. What is its Fourier series? (This e.xample is not so artificial 
as it might seem at first sight. If a constant pressure is applied to a body 
at regular intei^’als, the behaviour of the body may depend on its 
resonance with the various harmonics. The same will apply to an 
electrical system transmitting a signal composed of a series of Morse 
dashes.) 

(2) A periodic functiony — f has the following values: 



1 

1 

1 C> 

7 T 

6 i 

1 

1 

277 ; 377 ■ 

6 1 o' ; 

417 

6 

1 

« 


777 

1 ^ 

8:t 

6^ 1 

t 

^ lOn 

1 V 

1 Iff 

6 

y 

1-3 

1*666 

1 

! -8 —5 

_ 1_ 

— I *1 

— •966 

—7 ! 

J 

— •466 

— •2 

— •1 ■ 

1 

' i 

-•I ! 

■366 


Find its components up to the terms in cos 3? and sin 3/. 
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15.3 Partial fractions 

A straightforward subtraction of —-— from —— shows that 

X i X ~ 2 

_^ _ I ^ (X i) — (x — 2) ^ _3 

X — 2 .V + I (at — 2 ){x + l) (.V — 2)(.V + l)* 

On division by 3 this can be written 


{x — 2){x + i) 3(.v — 2) 3(.v + O 

This result makes it possible to integrate ---- ; for by the 

^ ^ (.V - 2)(a: + i)’ ^ 

equation above 

r dx _ ^ dx _ ( dx 

J (x — 2j{x -i-T) ~ J 3(r - 2) J 

= J In (a: — 2) — i In (at + 1) + C. 

(If a: is complex the more general function Ln will be used instead of 


In). Similarly from the equation 


.V 4 - 2 AC + 4 (a: + 2)(a: + 4) 


we 


have ,-i. _I_ and therefore 

(r + 2 )(a; + 4) 2(a: + 2) 2(.v + 4) 


dx 


= i In (at 4- 2) — J In (a: 4- 4) 4 - C. 


J+ 2)(.v + 4) 

In general we find, provided that a ^ p, 

I _ I _ I 

{X — a)(A: — p) (a — p){x — a) (a — p){x — p) 

and this enables us to integrate the expression on the left. This is 
called a decomposition of--- into “partial fractions”. 

{X — a)(Ae — p) 

Now suppose that we have three factors, such as - - - *_ 

(a:-2)(ac—i)(a;4-i)* 

By taking two of these factors at a time we find the following decom¬ 
positions into partial fractions: 


(Ar‘- 2 )(^-Y) 


X -- 2 


X — I 


{x - 2 ){X 4 - I) six - 2) 3(a: + 1) 


(.V — i)(x 4- i) 2 (a: — i) 2(.v 4- 1 ) 




BIOMATHEMATICS 


416 

These formulas can be used to decompose the original three-factor 
expression, thus, 

{x—2){x—i){x-\-i) (x~2) [(A:-i)(a: + i)J 

(x—2) L2(a;— i) 2 (a:+i)J 


2(a:— 2)(ar—i) 2 (a:— 2)(Ar-l-i) 

^ \_x-2 ^ {.3(^-2) 3(^+0] 


^ ^ 


and so 


3(a:— 2 ) 2(a:— i) 6(a: + i) 


dx 


A similar argum.ent shows that anyfraction of the form,-——-——- 

ABC 

can be broken into partial fractions — - - - where A B 

X—a x—p X—y 

and C are constants, provided that a, jS, and y are all different. For 


I 

r _ _ ' 1 

I 

r ' ' 1 

(.v-a) 

i 

I 

(*-“) 

=r 

_ ' 1 

P-y[ 

1 

1 

1 

1 

i 


and each of the terms within the square bracket can be decomposed 
into partial fractions. It follows that we can also decompose an ex¬ 
pression like —(where a. y, 8 are all unequal). 


For if 


(A*-a)(.V-^)(.V-y)(.V-S) 
I 


A , B , C , 

+ ;-+ --c, then 


(.V-a)(.V-^)(.V—/) (.v-a) ‘ (.V-^) ' (.V-y) 

_.- _ + _ 

(.v-a)(A’-^)(.v-y)(.v-S) (.v-a)(.v-5) (a:-^)(jc— 8) (.V-y)(.V—8) 

and each of these terms can be split up, finally giving an expression of 

the form where A', B', C, and D’ are 

.v-a a:— j8 .V—y .v —8 

constants. Thus we find, for instance, 

_ 1__ ^ ^ ^ * 

(.V—2)(.v— i).v(a; + i) 6(.v—2) 2(a-— i) 2.V 6(.v + i) 
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The same sort of decomposition applies for any number of unequal 
factors. 

However, the expressions we wish to integrate in practice are more 
likely to be ones such as 

- 3.v2 + X - 3 ^ - 3^- + X - 3 

— 2x2 — X + 2 (jc_2)(:c_i)(:c + i) 

that is, the ratio of two polynomials. We shall try to reduce these to 
partial fractions. The obvious first step is to divide the numerator 
by the denominator as far as it will go: 


X + 2 


X® — 2x2 — X 4- 2 


) X^ + 0X2 __ 2X2 ^ _ 2 

X^ — 2x2 — x2 + 2.V 

2X® — 2X2 — — 2 


2x2 _ ^2 _ 2 a; 4- 4 


2 X2 _j_ ^ 

This gives a quotient (x 4 - 2) and remainder (2x2 4- x — 7), so that 

X* — 3x2 4- jc — 3 = (x2 - 2x2 — X 4- 2)(x 4-2)4- (2x2 + X — 7) or on 

= X 4- 2 4- 2^^+-v-7_ 

X® —2X2—X 4-2 1- ,y3_2x 2—x4-2' 


division by (.v®—2x2—x 4-2), 


It is therefore sufficient to concentrate attention on the expression 

2x^ -j- ^ 

3 —— -Now we have already reduced i/(x 2 — 2 x 2 —x4-2) = 

i/[(x—2)(x —i)(x 4 'i)] to partial fractions: so on multiplication by 
(2X24-X—7) we obtain 

_2.X24-X—7 _ 2 x2 4-X —7 2 x2 -{-X —7 

(x —2)(x—i)(x 4 -i) 3{x — z) 2(x—l) 


2x2H-x —7 


6 (x 4 'I) 

Each of these fractions can be reduced to a simpler form by dividing 
through by the denominators 

3(*-2) 3 + 3 + 

- 2 X - 3 -f. 4 

2(X —l) 2 2 2(X— 1 ) 

2X2 q-x—7 ^ 2 __ 1 6 

6(x-bi) 6^ 6 

and so by addition 


6(x4-i) 


2x2 4 -X—7 


4 - 


(x-2)(x —l)(x 4 -l) X-2 ^X-I 


BE 
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all the other terms cancelling out. This cancellation is no mere acci¬ 
dent: it happens in the general case, as can be seen from the following 
argument. We shall take for the purposes of illustration an expression 

like with three factors in the denominator: 

{x-~a){x-^)(x-y) 

but the form of argument is perfectly general. (Here the letters /, e, 
d, c, b, a stand for arbitrary numbers, and have no connection with 
the special uses of the letters e and d in connection with logarithms 
and the calculus.) As before, the first step will be to divide the numera¬ 
tor by the denominator, obtaining 

- (x-a){x-n^-Y) ^ {x~a)(x-^){x-Y) 


Here 4-^^:+A) stands for the quotient, and (c'.v- 4 -i'A^+< 2 ') for the 
remainder, which cannot contain any terms in .v® or higher powers (for 
if it did we could continue the division a stage further). We shall 
now forget about the term (hx^-\-jx-{-k) and consider only the re¬ 


mainder 


f'.v- -{-b'x+a 

{x-a)(x~^){x-Y)' 


We know that, if a, / 3 , y are all distinct, 


i/[(.v —a)(.v—j 3 )(.v—y)] can be decomposed into partial fractions of 
the form Ai{x—a)-\-BI{x~^)+Cj{x—y), where A, B, and C are 
calculable constants. Therefore 


f'.Y"4 b'xAa' 
{x-a){x-^){x-y) 


A{c‘x'^Ab'x 4 -g') 

.V —a 


-t- 


B{c'x'-\-h'X'\-a') 

x—^ 


^ C{c'x~+b'x-\-a') 
x—y 



Now suppose that when we divide A(c'x' 4 '^'*v 4 -<j') by (.v—a) we obtain 
a quotient Qi(.v) (which will be a polynomial in at) and a remainder 
7 ?, (a single number). Similarly let the division of B(c'x^~\~b'x-{-a') 
by (.V—give quotient 020''*) remainder ^2» division of 

C{c\\-- h'x ^a') by (.v—y) give quotient J23 (.v) and remainder R^- 
'rhcn (15.15) can be written 


' <> 1 I ' f 

c .V- -* b x-a 

(.V -a)(.V-/i)(.Y-y) 


= QM 


+ 5.(-v) + 


X — a 


.V—^ 


+ ? 3 (*^) + 


R 


x—y 


= l2(-v) + 


— + —\ + 
x — a X—p 


R 


(1516) 


where P(.y) -- P,(.v) 4 - p2(.v) 4 - p3(A') = a polynomial in x. This 
equation will be true for all values of .v except .v = a, .y = /S, and x = y, 
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when some of ihe denominators become zero. Let us multiply it 
through by {x — a)(x — ^){x — y); it becomes 

c‘x--\-b'x-\-a = 0(x){x—a){x—^){x—y) + R^(x~^)(x~y) 

+ —a)(.v —y) + R3{x~a){x—^) 

This again will be true for all values of x except possibly x = a, x — 
and X = y. But it is a relation between polynomials, and this implies 
(Section 3.10) that the polynomials must be identically equal, and must 
in fact reduce to the same expression c'x--\-b'x-\-a' on simplification. 
In turn this implies that Q(x) = 0; for if (^(x) was different from o 
it would introduce terms in x^ or higher powers of .v, and such terms 
do not occur on the left-hand side, or anywhere else on the right-hand 
side. So putting Q{x) — o in (15.16) we have 

c'x^+b'x+a’ _ ^1 , R* 4- 

(JC —a)(.V—)9)(.V —y) X — a .V—^ -V —y " ' 

showing that any expression of this form can be broken into partial 
fractions. 

In theory this argument not only shows that the decomposition 
(15.17) is possible, but also finds the values of /?,, R^, and R^. But 
in practice there is a short cut once it is known that the decomposition 
is valid. To illustrate this point we return to the example 


2 x- -f-x —7 


(.V-2)(.X-l)(.V + l) 


and write 


_2xM .v-7 = ^ 

{x-2){x-l){x-\-l) Ar-2 .V-I .v-f I * ■ J 

where A, B, and C arc three constants whose value we wish to find. 
Multiply this equation through by (at —2); it becomes 


2X^'-\-X-7 ^ J , B(x-2) C{x~2) 

(A.—l)(.V-j-l) 1 


(* 5 * 9 ) 


Now put a: 2; the terms containing B and C vanish, and we arc left 


with 


2(22)-|-2—7 ^ 

(2--l)(2-(-l) 


Similarly on multiplying (15.18) through by (a: — 1 ) we get 

2x^-\-x-7 C(x~l) 

{x- 2 ){x~^ I) a:-2 ^ irT 7 ~ 

and putting a: - i this becomes 2 B\ also multiplication of (15.18) 
by {x + i) and setting jc = — i gives — i C, so that finally 

—7 ^ _i 2 _ 1 

{x -2)(.v-i)(a; + i) x-2 X — I ;rh“i 
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exactly as before. However, as the reader may have noticed, the 
reasoning is strictly speaking fallacious, although it has given the right 
answer. The equation (1518) we began with will be true only if a: is 
not equal to 2, i, or — i, for these values give zero denominators: so 
the equation (15-19) derived from it cannot immediately be regarded 
as necessarily true if x = 2, which is the value we substituted to find A. 
However, this logical gap is easily bridged. Although {15.19) has not 
yet been proved to hold when x = 2, it will certainly be true for all 
values of x near 2, except possibly the value 2 itself. Therefore the 
limit of the left-hand side as a: ^ 2 must be equal to the limit of the 
right-hand side (for they are limits of the same quantity), i.e. 

■ ^ ^ + lim fc?) + lim 

y_, 2 (X — i)(a:-|-i) X—*-2 x-»2 X —I X—^2 AJ-j-I 

But the limit of the left-hand side is simply its value when a; = 2; the 
limit of A is Ay and the limits of the other two terms are equal to their 
values when x — 2, that is, zero. Thus our procedure is completely 
justified: we merely have to regard it as taking the limit as a: 2 
instead of the actual value when x = 2. 


EXAMPLE 


(i) Decompose 


2 A--^3 


(.v-ri)(.v-h3) 


into partial fractions. 


Let 


2 X--f 3 _ _j_ ^ 

(.V -f -1 )(X 4 - 3 ) a: -h I .V 4-3 


Multiply by (.v4- i) to get-? = A letting x 

•v 4-3 a: 4-3 

we have A — 

Multiply bv (■v+3) to get q. jetting ^ 

.X 4 -1 .V 4 -1 

we have 5 = II. 


—I 


-3 


PROBLEMS 

Decompose the following into partial fractions: 

( 1 ) 2.v/[(.v-i)(a- + i)]. 

( 2 ) (3-v4-2)/[(a-i)(.v~2)(.v-3)]. 

(3) (4.v=4-2.v-i4)/[(.v4-i)Gv-i)(.y-3)]. 

(4) ( 7 -v+ 8 )/[(.v-i)(.v-f 2 )(.Y-f 5 )]. 
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X5.4 . Partial fractions with repeated factors 

If a fraction has a denominator with repeated factors, such as 
i/(x~2y(x-\-i), the technique has to be modified. 

Since 

I _ I I 

(x^-2){x + i) ~ 3{x-2) 3(^ + 0 


we can split up the fraction ;-^,—7-r as follows; 

^ ^ {x- 2 nx-^i) 


(x~ 2 nx+i) 


lx-2) [3(^-2) 3(^ + oJ 

“ six-2)^ 3 -Ti) J 


= * _ _ ^ -f- i_ ( \ 

3{x—2y 9(.v—2) 9(j>: + i) ^*5-20) 

This is as far as we can go: for we cannot split ^—• into fractions 

3(^-2)" 

A B 

of the form-+-, for two such fractions are equivalent to one 

X ~-2 x —2 ^ 

single fraction which is quite different from — . 

x—2 3{x-2y 


By the use of (15.20) more complicated expressions can be dealt 
with; for example 

(x—2)3 (.t + i) “ .V —2 —2)=^(jc + i)J 

V„] 

3(jc-2p 9(^-2)“ 9(Ar-2)(A;-t-0 

= - _ i + i_ _ ^ 

3 (^- 2 )® 9 ix~ 2 )^ 2 y{x- 2 ) 27 (a* + i) 

ix~ 2 y(x-\-i)^ [(JC— 2 ) 2 (a; + i)J i+7 

“ 3C^2)"{^ + 0 “ 9(a;-2)(:c-)T) 9{x + iy 




3 (a:— 2)2 9 (.v 


^ 9 (^ 
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9(x-2y 2l(x-2) 


+ 


27 (a:+i) 27 (a:— 2 ) 

I I 


+ 


2a; + i 


9 (»- 2 )“ 
2 ^ + 1 


27 (*- 2 ) 
2a:4-i 


+ 


27(2:+1) 

2 I 


+ 


9 (af-M) 


2 


(*-2)^(* + l) 


3(*-2)“ 9(*-2) 

I 2;C + I I 

9 




27 {* + l) 

2X-\-I 


9(x+iy 


3{x-2) ’ X-2 


9(^+0 

2a:4-i _j_ I 

X—2 9 


2Ar4-i 

a;+i 


= -T^h+ —1+T"- 4-1 

3 (*- 2 )L *-2j 9 L *-2j 9 L * + iJ 


_ S 


3 (*- 2 )‘ 
_ S 


+ p__ 5 l_^_ 

L3 9J *-2 


9(* + i) 


+ _i_ 

3(*-2)“ 9(*-2) 9(* + i) 

We see, therefore, that whenever there is a repeated factor (a; — a) 
in the denominator there will be partial fractions of the form 




4 


A 


X—a (.V—a)2 

P{x) _ A, 


4 


A 


4 


{x~ay 

A, 


4 • • ♦ 4 


A 


n 


{x-ar * 


e.g, 


4 


A 


4 




B 


—T --4 


(»—i)®(x 4 i)*(Af 42 ) a;~i (.V—i)* (*—i)® ^ 4 i (^ 4 i)® x+2 

where P{x) is any polynomial not containing powers of x higher than 
x^ (or in general being of a lower degree than the denominator). The 
proof will follow lines very similar to that for distinct factors. If two 
different factors, such as (.v — a) and (a: — / 9 ), occur in the same 

denominator, we can separate them by using the identity ;- - -5^ 

(ap— f^){x — p) 

= —„JL-- — --1-; and by repeated application of this 

(a--p)(x—a) {a—p){x—p) 

the expression can be reduced to fractions of the form P(x)/(je—a)", 
where {*—a) is one of the factors and P{x) is some polynomial. We now 
divide P(x) by (a;— a), to give the quotient Pi(ap), say, and remainder 
Riy so that 

P{x) _ P,{x)ix-a)R, 


{x-ar 


(*-“)” 

PM ,_ 

(*—a)"-‘ (*—a)"' 
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P,(x) is now in turn divided by (a: — a), bringing the fraction into 

- 4 - — ' : continuing in this way we 

^ (x-aY 


the form 


(x — a )"”2 ‘ (a* —a)"”^ ' (.V —a) 

reduce the whole expression to a sum of partial fractions of the type 
we require, together with an extra polynomial. As before, this extra 
polynomial can be shown to be zero when the numerator of the original 
fraction is of lower degree than the denominator. 

Thus, for example, the decomposition - “jy partial frac¬ 

tions can be performed as follows: 

a:=-|-3.v-|-i _ (j:^+a:+ 4)(jc —i)+ 5 

■ (*v-ir 


{x-iy 


^“+■^-±+4- 5 

(.V—i)® (x —l)‘ 

(.v+2)(jc —i)-t-6 5 

{x-iy ^{x-iy ^{x-iy 


— .1.-1. 4- 1 4- _ ?_ 

AT-I {x-iy ^{x-iy‘ 


The problem now arises: can we shorten the calculations, as we did 


for unequal factors? 


Take, for example, : 

(a: — 1 )=*^; 


wc know this has the 


decomposition 

( 2 A:^- 3 ) ^ 4. . ^3 . I 

(.V —i)^v a: —1 {x — iy (.V —i)® .V 


The value of B can be found by multiplying through by .v: 

( 2 a:+ 3 ) ^ , ^3^ , fy 

{x~iy x-i (x-iy (x-iy ^ 


and then letting .v tend to o: this gives B = —3. Similarly on multi¬ 
plying by (x —1)=* and letting a: -> 1 we find = 5. Hut this method 
will not give the values of > 1 , and A 2, and we need a further stratagem. 
The simplest one is that of substituting particular values of .v in equa¬ 
tion (15.21). If wc put a: = 2 we find 

^ = At A- A 2 -\^ A ^ -\- IB, 

AI A~ A2 = I — A:i — A/i — o 


i.e. 
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If we put a: = 3 we find 

f = 1 + 2 + 3 

i.e. + i^2 = i - i^3 - f 

These two equations for Ai and A 2 can be readily solved to give 
= 3 . ^2 = — 3 » so that 

+3 _ 3 . 5 _3 

(jc— i)3ac a:—I (*—i)^ ^ 

Any expression thus decomposed into partial fractions can be readily 
integrated: 


f {2x-i'2)dx 1 

■ 3rfx 

3dx , 

' 5^^ 

■ 3^ 

J {x-iyx J 

X —I 

V 

{x-iT- ^ 

1 J 

X 


= 3 In (a;-i)+ 3 /(a;— i)— 5 / 2 (x-i) 2—3 In x+C. 


PROBLEMS 


Find the following integrals: 

(I) [ . (2) [{ 1 +^ (,) f 

(4) f _ ^ _ u) f (-^ +3« +5)^» 

J(* + i)V+4) J (* + 0‘ 


)V+4) 


, , r (zx+i)tix 

J(*- 2 )=(* + ,) 


(*+i)‘ 


15.5 Complex roots 

The method we have explained for decomposition into partial 
fractions can only be used if we can factorize the original denominator. 
In Section 3.10 we discovered a theorem which helps us to do this: 
it states that the polynomial 

P(.r) — A -{■ Bx Cx^ + . . . + Hx^ 

has a factor (x — a) if and only if a is a root of the equation P(Ar) = o, 
i.e. if and only if 

A + Ba + Ca* + . . . + /fa" = O. 

Now if the numbers used are restricted to real values there may be 
no such roots: e.g. the equation i + a:* = o has no real roots, and 
therefore i + a:- has no real factors. But there are two complex roots, 
X = i and x = —i, and therefore two complex factors of i + namely, 
X — i and .v + i. Similarly we have found that the equation x^ — i, or 
x^ — 1=0, has two complex roots x ~ <v and x ~ in addition to 
its real root x = i, so that x^ ~ i has the factors (a: — i), (x — w) and 
(x — w^). This is a particular example of a general theorem, that an 
equation of the «th degree 

A + Bx + Cx^ + . . . + i/x" = 0 {H ^ o) 
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has always at least one root, which may be complex; and as a rule it 
has n roots. This is true whether B, C , , . H are real or complex. 

explanation of this theorem is perhaps best illustrated by con¬ 
sidering a special case, such as a cubic equation 

A + Bx Dx^ = o 

where x can represent either a real or complex number. D ^ o, for 
otherwise the equation would not be cubic. Now it will be convenient 
to consider the special case Z) = i, i.e. to take the equation in the stand¬ 
ard form 

A Bx Cx^ x^ = o . . . (15.22) 

Since D o the equation can be reduced to this form by division by 
from A + Bx -h Cx^ -j- Dx^ = o we obtain (AID) -h {B!D)x -f 
-f-.X® = o which is of the required form (15.22) with AID^ 
BjD^ CjD in place of A, B, C respectively. 

Now let us put y — A Bx + + x^. Suppose that the com- 

plex number x is represented by a vector OP, and ^ by a vector OQ 
(Fig. 15.4). (Note that x and y represent distinct numbers here, not 



the real and imaginary parts of a single complex number as in the 
last chapter.) Thus to each point P there corresponds a number a:* 
to each number .v there corresponds y = A Bx Cx- -f and 
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to each y there corresponds a vector OQ, and therefore a point Q. In 
short we can say that P represents and Q represents y. 

Now imagine that P is moved to trace out a continuous curve G\ 
then Qy being determined in position by the position of P, will move 
so as to trace out another continuous curve G'. In particular we shall 
imagine P to move round a circle G of centre O and radius R. What 
can we then say about G' ? 

First let us suppose that R = |a:| is large. Then we know that in 
the expression y A + 5a: + Cx^ -h the term Ar> will far out¬ 
weigh the others. That means that y will be nearly equal to a :^, in the 
sense that the difference between y ~ A Bx C.v^ + x® and y^ 
(say) = x^ will be negligible in comparison with their absolute magni¬ 
tudes. Let y" be represented by a vector OQ": then will not be 
far from Q, and as P traces out the circle G, Q" will trace out a curve 
G" not very different from the curve G' traced out by Q. 

Now since P lies on a circle of radius 5 , the vector a : = OP can be 
written { 5 , 6 }: and as P traces out the circle, 6 will change con¬ 
tinuously in value from o to 27 t. But y" = x^ — {R, 
consequently 3 9 changes in value from o to 677, and the point Q" 
representing y" will run three times completely round a circle G" of 
centre O and radius 5 ®. (This circle is shown by the dotted line in 
Fig. 15.4.) It follows that Q, which always lies relatively near Q", 
w’ill trace out a loop G', nearly circular in shape, and encircling the 
origin three complete times before returning to the starting point. 

'I'he situation we have described w'ill hold when R is large. But 
whatever the radius R may be there will be a circle G, traced out by 
the point P, and a corresponding loop G' traced out by Q. Suppose 
we start with a very large circle G, and imagine it gradually shrunk 
until finally its radius becomes zero and it coincides with the origin O. 
Then the loop G' will also shrink, rather like a loop of string being 
tightened up. This shrinkage may not be uniform in all parts of the 
loop: indeed it is conceivable that some parts of G' may move out¬ 
wards for a time during the process—again very much as a loop of string 
can move about as it is being tightened. But in the end G must contract 
to the single point A^, where the vector OA^ represents the coefficient 
A w’hich is the value of y w-hen x = o. 

We now have two cases to consider: 

(i) It may happen that A = o. In that case the equation y = o 
becomes r ^ Bx H- Cv* -4- .v® = o, and this certainly has the root 
.V = o, and the theorem is therefore true. 

(ii) The other case is that A ^ o. But A = the vector 0 . 4 1, so 
that the point A^ is different from O. Now' on inspection of Fig. 15.4 
we see that it is inconceivable that the loop G' could shrink to the single 
point A I w ithout passing at least once over O. In fact, since G' is a 
triple loop, we shall expect it to pass three times over O during the 
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shrinking process: but in exceptional cases all three loops might pass 
over together at the same point and at the same moment. In saying 
that the contrary is inconceivable we are appealing to our common 
experience of strings and loops: but it is possible to give a strict 
though quite complicated mathematical proof of this point. Now G' 
is by definition the loop traced out by Q, where OQ is the vector repre¬ 
senting = A Bx + Cx^ + So if G' passes over O, there must 
be some point Q = that is to say, there must be some value of 
y = OQ = o. The corresponding value of .v is therefore a root of the 
equation y = o. If the loop G' passes at least 3 times over O, as will 
usually be the case, then there will be at least 3 corresponding roots 
of the equation y = o. But a cubic equation cannot have more than 
3 roots (Section 3.10), so usually it will have 3, and always at least 
I root. 

By replacing the number “3” by "n" throughout we obtain a similar 
proof of the theorem for the «th degree equation 

y = A + Bx -h C.v* -f- .v" — o. 


Returning to the cubic equation, let a be a root. Then y = -f- 
Bx -f Cx^ - 4 -a:® must be divisible exactly by .v — a; let us say 
y = (x — + B'x + x^). But the equation A' + B'x + .v- = o 

must in turn have at least one root which may or may not be different 
from a, and therefore A' B’x + -v® = (a: — ^)(A" H~ at), and y — 
(x — a)(A: — ^)(A" -f x). Finally the equation A" -j- x = o has the root 
y = — A"^ and so we can write 

y = A + -f- Cx^ 4 jc -3 = (.V — a)(x — P){X — y) 

i.e. any cubic expression can be completely factorized into linear factors. 
The same sort of argument holds equally for an wth-degree polynomial 

A+BxACx'^+ . . . = (x —aXx —^X^* —y) • • ■ —(15-23) 

there being n factors in all, though not all the factors need be unequal. 
If the coefficient of x" = // is not unity, then we shall have 

A-\-Bx-\-Cx'^-[- . . . 4 //.v" = H{x—a.){x—fi) . . . (x —A) (15.24) 

Thus any polynomial can in theory be completely factorized. It may 
not be very easy in practice to find these factors: that is a point we shall 
discuss later. But it follows that any “rational function” or ratio of 
2 polynomials 


/(x) = (a + bx 4 rx* 4 . . . + kx”')l{A 4 Bx 4 Cx® 4- - . 4 //x") 
can be expressed in the form ^x4-rx 4 . . • 4-^' 

It can therefore be decomposed into partial fractions by the methods 
we have explained above, and so we can find J f{x) dx. 
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PROBLEMS 

Integrate (i) j (-) j ) 

r +^) ^ 

J (x+iyix-iy 

15.6 Real factors 

We have now obtained in theory a general solution of the problem 
of integrating any rational function, i.e. the ratio of any two poly¬ 
nomials. In fact, this solution is not only a theoretical one, it is also 
quite practicable. But when we wish to integrate a real expression it 
may be rather inconvenient to have to manipulate a number of complex 
roots of an equation, and it may be useful to rearrange the results in 
a form involving real numbers only. 

Suppose that y = P(a;) — A -h Pjc + C.v® + • . + .v" is the poly¬ 
nomial we wish to factorize, where the coefficients A, B, C . . . are all 
real. If all the roots of the equation P(x) = o are real, the previous 
section gives us a real factorization. If not, let be a non-real root: 
then P{x) will have the non-real factor x — aj. It was, however, shown 
in Section 14.18 that the complex conjugate Si will also be a root, and 
P(.v) will have the factor x — Si. This is different from x — aj, since 
a, is not real, and therefore P(.v) has both factors (a: — aj) and {x — Si), 
so that 

P(.v) = (.V — ai)(.v — Si) Pi (:v) 

where Pi(-v) is the quotient, a polynomial of degree 2 less than that of 
P(.v). Let us write ai as the sum -f ib^ of real and imaginary parts; 
by definition Sj = <2i — ib^ and 

(.V — a,)(.v — S,) = (.V — f7, ~ ibi){x —Ml + f^i) 

= .v= — 2a iX + + *1^) 

= X- — za^x 4- Cl (say) 

where Ci = fli^ + 61- — lai|“. So 

P(.v) = (.v2 — 2aiX + c,) P, (.v), or 
P,(.v) PGv)/(.V" — Zfli-V Cl). 

But the numbers and Ci are real, and P(.v) has real coefficients: the 
polynomial P|(.v) obtained by division must therefore also have real 
coefficients. \Ve can now repeat the argument with Pi: if it has a 
non-real factor (a: — a^) = (a: — <22 — *^2) it will also be divisible by 
(.v — S2) = (.V — ^2 + *^2) therefore by (*v — a 2){.t — a 2) = 

X- — 2a..x Cj. Continuing in this way we can reduce any real 
polynomial to real factors: 

P(.v) -- (.V“— 2fl..v—c.)(-V“—2«.vV—Co) . . . (.V—/ 9 ,)(.v—^2) • • (15*25) 
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where the quadratic expressions {x^ — 2aiX — rj, {x^ — 20^ — c2). 
etc., are obtained by combining pairs of complex conjugate factors, 
while the linear expressions (a: — ^1), {x — ^2), etc., are obtained from 
the real roots of the equation P(x) = o. These factors need not be all 
different. A quadratic factor can be repeated: we might have 

x^ — 2aiX ^ Ci = x^ — 2a2X — c^; 

and the same holds for linear factors. 

Thus since the roots of the equation x® = i are oi, to® and i [for 
Zi = to®! the polynomial x® — i can be factorized in complex form as 
(^x — <o)(x — (o®)(:c — i), or in real form as (x® + a: + i)(.v — i). 


15.7 Partial fractions with quadratic factors 

Since two complex factors can be combined into a single real 
(quadratic) factor, it will be natural to do the same with the corres¬ 
ponding partial fractions. Thus 

I _ ‘"4. to® _ 1 

jc®—I 3(^~0 

by the ordinary decomposition. Addition of the first two partial fractions 
gives 

CO I to® _ a)(.V—to®) -|-co®(x—to) 

3(x—to) 3 («—to®) 3(x—to)(.v—to®) 


—X—2 


Similarly 


2X 


2X 


3(x®-h.v-hi) 


I 4-1 


+ 


I —t 


X®-1-x®-i-x 4 -1 (x 4 -i)(a; 4 ' 0 (-^— 0 2 (x 4 -i) 2{x-i) 

4 

and by adding together the first two fractions we obtain 

2x _ x 4 -i _i_ 

X® 4 -x®^x 4-1 X® 4-1 X 4-1 


.v-hi 


In general two conjugate fractions will give 

^'1 _ «i) + ^ BiX 4 -C i 

X —a, X—ai —«,) .V® —2t2ix4-tri 

where Bi and Cj are constants. The numerator now contains a term 
in X as well as a constant term. Thus if it is required to find the decom¬ 
position of i/(x® — 1) without bringing in complex quantities it is 
necessary to write 

_^ j?x4-C D 

(x® 4 - ^ 4 -1 )(^ — 0 4-x 4 -1 


X® —I 


X—I 
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By multiplying through by (j: — i) and letting x tend to i we find 
D = ^. The coefficients B and C must be found by taking particular 
values of x. By taking a: = o we find — i = C — whence C = §. 
By setting x = —i we obtain —^ = (—B + C) — ^D, whence 
5 = — Thus the decomposition is 

I ^ _ X+2 I 

a;*—I 3 (a;’*+a: + i) 3 (.v—i) 

A repeated quadratic factor must be dealt with as follows. Let 

2 a:^-|- 6 ji:^+ 8 .v®- 1 - 8 a:“+ 3 a: _ Bi.v+Ci B2X +C2 ^ 3 A:+C .3 

(x—i)(A; + i)(.v==+-v 4 -ij^ (.v^+.v + i)^ (a: 2 +a;-(-i)^ 

D E 


+ 


+ 


x—i a?4-i 

We can find D by multiplication by {x — 1) and taking the limit as 
.V -> I: this gives D = \. Similarly E = —The remaining co¬ 
efficients must be found by taking special values for x; this is rather 
tiresome but straightforward and finally gives Bi — o, Cj = —i, 
B2 = i» C2 = I, B2 = o, C3 = I. 

We now have to face the problem of integrating an expression of 
the form (Bx 4- C)i{x’ — 2ax + c)". To illustrate the process, con¬ 
sider the particular case f “complete 

the square” in the denominator. Here (.v® — .v + ^) == (.v — i)- 4- (i)“: 
in general x^ — 2ax 4 - = (a* — aY 4 - (c — or) ~ (-v — a)“ 4- 

where b — ^/'(c — a-), {b- = c — will always be positive, since 

x- — 2ax 4 - c has non-real factors.) 

Now change the variable from x to X ~ x — a ; the integral becomes 

Bx-\-C ( B{\-\-a)-\-C dx ,y 

(.v^-^2^v4-7)'‘ j dX 


I 


r BX+c 

J (X^ 4-62)" 


where C — Bo -{- C; and in our particular example, AT = .v — i, 

dx _ f AT4 -1 . Y 

(x^x+^Y ~ J (x^TW 


I 


=1 


XdX 


4 - 


I 


%dX 


(A’^ 4 - 1)2 ‘ J {X^^IY 

The first integral is easily dealt with by the substitution t — X- 4 * 
for then dtjdX = 2A, i.e. X — ^dtjdX, and the integral becomes 

IdtjdX . dX _ f dt 




J 2 t'- 


4-a: 


= - \{X^-^\)-^ +K = _i(.v 2 _.v 4 -«-^ 4 -A:, 
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where K is the constant of integration. (We cannot use the customary 
letter C, as it has already been used in another connection.) In general 

r BX dx 

the integral 1 - ^ can be evaluated by substituting -f ^2 _ 




4 dX 


it then becomes J dt. 

The second integral will be J 
If n = I this can be directly integrated, for 


, or in general 


C'dX 

(A' 2 + 62 )'-' 


i 


= C'6-1 tan-i {Xlh) 


by a standard formula. If n > i then we invoke the following reduction 
formula : 

{ dX _ X ( 2 «- 3 ) f dX 

J {X^^b^Y i2.{n-i)b%X-^^b^Y-'^ 2(«-i>= J {X^~+b-Y-^ 

■ ■ (15-25) 

which is easily checked by differentiation. This expresses the integral 
j (X^Wr integral j and by repeated 

use of the formula we can finally simplify it to | 
known. Thus in our case « — 2. 

UX 


_dX 

(A: 2 + 62 ) 


which is 


i 




. + --- 


i. I r 

• f J 


{X^+iY 2 . f. (X^ + i) 2 

= 4- 6 tan~^ (2 a: —i) 

x^-x + h 


_dX 
X^ -f- i 


%dX 


An addition of the integrals j j gives 

f + _ 


_‘_+ 4 - 6 tan-* (2.V-1) + K 

2X^—2X-\-l JC*—JC + i 


_ 3^—2 


4 - 6 tan-* (2jc —i) 4 - AT 


x^-x-^^ 

Any other real rational function can be similarly dealt with. 
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PROBLEMS 


Integrate the following functions: 


(0 


( 4 ) 


(*+ 3 )(*- 4 ) 


+1)- 


( 2 ) 


( 5 ) 


(*'+ 4 )(*+ 3 ) 

2X—4 

(A:2 + i)(A’2H-a; + i) 


( 3 ) 


x -\-3 


(x^+4)(x2 + i) 


15.8 Rational functions of sines and cosines 

Any rational function of sin x and cos x, as, for example, [i 4- sin x] — 
[cos X + sin .v] or [(cos x)- + 3(sin j:)®] / [sin jc + i], can always be 
integrated. The general method is to make the substitution t — tan ^x. 
Then sin x — 2il(i +/^), cos x = {1 — and dtjdx ~ 

2/(1 + t^). The integral is therefore reduced to that of a rational func¬ 
tion of and can be evaluated by the methods given in the preceding 
sections. 

Similarly any rational function of sinh x and cosh x can be integrated 
by the substitution t — tanh for then sinh .v = 2f/(i — t“), cosh x = 
(i + and dtjdx — 2/(1 — t-). 

In particular cases there may be a short cut. For example, an ex¬ 
pression of the form J /(sin .v) . cos .v . dx can be integrated by the 
substitution X — sin x, for cos x — dXjdx and the integral becomes 

//w 

EXAMPLES 


dX. 


(1) f (sin ;e)® cos x . dx = J X^ dX 

= IX* I (sin .v)* + c. 

(2) J (sin .v)"(cos .y)® dx = J (sin .v)^[i — (sin .v)-] cos a: . dx 

= f X\i - X^-)dX 

- \X* - + C 

= J (sin .v)* — J (sin .v)® + C. 

Similarly any integral of the form J /(cos at) . sin a: . dx becomes 
— J J{X) dX on substituting X = cos X. An integral of the form 

J (sin A-)"'(cos .v)" dx, where m and n are positive integers or zero, can 
always be found by using the identities cos a: = i(e*® -}- e”*'), sin a: = 

(c*-* ^ e-*'),2L 
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EXAMPLES 


(i) J (cos xy dx = J \ +24- dx 


»2t X 


.«2< X 


= V + -- V-+C* 

82 2 8z 

= I sin 2X 4 - |.tr 4- C. 

(2) f (cos ;t)®(sin = f —-{•e-^^)-{e*^—e-^^Ydx 

= — -.V / {e^‘^—e-^'^y-dx 

— —T^e' f (e**^ — 2 +e-*^^)dx 


64/ ^ 8 ^ 64/ 


=- 4 - C. 


= — 3^sin4Jc4-iA:4-C 


If either ?« or n or both are negative integers a (possibly repeated) 
application of the reduction formulas (verifiable by differentiation) 

/ (sin je)"‘(cos x)'*dx 


_ m -|-«+2 I :»:)"*(cos xy^-dx — 

I (sin jc)"’+''(cos xYdx — ^ 


m-f« 4 - 


(sin Ar)"*'*‘*(cos a.*)"+^ 
n + i 

(sin x)'”+’(cos x)’*'*'^ 


m-\-i 


will reduce the integral to a simpler form. 

If a function contains only even powers of sin x and cos x it can 
be integrated by substituting X = tan a*, (sin at)** = X‘l{i 4- AT-), 
(cos xy =1/(1 4- X'^), dxfdX = 1/(1 4- X^). 


15.9 Integrals containing irrational functions 

A rational function of x and ^)> where A and D are 

constants, can always be integrated. Examples of such functions are 
X 4 - V{x 4 - 1), {x^ 4 - 3-v)/[i 4 - Vi^x — 3)] and indeed any expression 
which can be obtained from x and V{Ax 4 - B) by using only 
combinations of additions, subtractions, multiplications and divisions, 
including additions and multiplications by ordinary real or complex 
numbers. The substitution X = \/(^x + B) transforms the integral 
of any such function into the integral of a rational function of X. 

FF 
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EXAMPLES 

(i) Find J xy/{x + i) . dx. 

Put X = ■y/(x + i), then x = — i, and dxjdX = 2X. The 

integral therefore becomes 


f (X^-i)X . 2 XdX = f { 2 X^ - 2 X^) dX 

- -IX^ Jr C 


= (v 4 - T^ 5 /^ 


2 / 


I 


\-i/o I 


(2) Find J (2x + 2)1 V(x — 2) . dx. 

Put X = '\/(x — 2): then x = X- 2, and dxjdX = zX. Thus 

/ (2.V + 3)/V(‘V - 2). dx = J {zX^ H- 7) . zXjX . dX 

= iX^ 4- 14A: 

= -^{x — 2)3/2 4- 14 (.V — 2 )*/ 2 . 


More generally any rational function of x and 



can be 


integrated, where w is any positive integer. It is enough to substitute 

( Ax 4- B \' 

when the integral is transformed to the integral 

* 

of a rational function of X. 


In particular this implies that we can integrate any rational function 
of .V and ^ (.v- 4 - Bx 4- C). For .v^ 4- B.v 4- C can be factorized as, 
say, (.V — a)(.v — jS) ; and then we can write \/{x^ 4 - -Sa: 4 - C) = 



So by the substitution X — 



we can reduce 


the integral to one of a rational function of X. However, if a and /S 
are complex this substitution is rather awkward, and it is then better 
to proceed by “completing the square”, writing X‘ 4 - Bx 4- C = 

(.V 4 - \B)“ -I- (C — IB) — X^ + where X = x ^B and b = 

v (f^’ ~ i^)‘ 'Xc then substitute X = b cos d and reduce the integral 
to a trigonometric one of the forms considered in Sections 15. i and 15.8. 


15.10 Integrals containing the square root of a cubic or quartic 
polynomial 

In general an integral containing the square root of a third or fourth 
degree polynomial cannot be solved by the methods of the preceding 
section. Such integrals are called “elliptic integrals”, and require the 
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introduction of new functions, called “elliptic functions”, which are 
very similar to the ordinary trigonometric functions in many ways. 
Unfortunately to present the theory in its simplest and most lucid form 
involves delving rather more deeply into the theory of complex functions 
than is possible here, and the reader is referred to text-books on the 
subject. A particularly simple and elegant account will be found in 
Jacobian Elliptic Functions, by E. H. Neville (2nd edn., 1951, O.U.P.) 
and a useful collection of formulas and tables in Jacobian Elliptic Function 
Tables, by L. M. Milne-Thomson (Dover Publications, 1950). 

Integrals involving roots of polynomials of degree 5 or more are 
best dealt with by numerical methods (Section 11.16). 



CHAPTER SIXTEEN 


PHYSICAL AND CHEMICAL MAGNITUDES 


i6.i Time 

There are two natural units of time, the day and the year. 

The day, or more accurately the “mean solar day”, is measured as 
the average interval of time between successive times at which the sun 
is due south of any given point. It is perhaps worth pointing out that 
this is not the same as the time of rotation of the earth, or “sidereal 
day”: for in consequence of the earth’s revolution round the sun the 
earth has to do more than a complete revolution to catch up with the 
sun. But it is the ordinary solar day which concerns us here since it 
corresponds to the regular alternation of darkness and light; the sidereal 
day is of interest only to astronomers. For convenience the day is 
divided into 24 hours, the hours into 60 minutes, and the minutes into 
60 seconds each, in a way which is both familiar and universal. That 
is, I day ~ 86,400 seconds. 

The year, or more exactly the “tropical year”, is defined as the 
average interval between successive spring equinoxes (times when the 
plane of the equator passes through the sun). Again this is not quite 
identical with the time of revolution of the earth round the sun, or 
“sidereal year”, because the plane of the equator is slowly changing 
its direction. That phenomenon is known as “precession”. Never¬ 
theless it is the tropical year which is of the greatest interest, since it 
corresponds to the regular recurrence of the four seasons. It has a 
length of 365-2422 days. 

In practice we replace the exact tropical year by a “civil year” of 
365 or 366 days. The alternation of leap and non-leap years is so 
arranged by the Gregorian calendar that the average length of the civil 
year over a long period is 365 2425 days; this differs quite inappreciably 
from the length 365-2422 days of the exact tropical year. 

Besides these units we have the conventional ones of the week = 

7 days, and the month, originally no doubt derived from the moon, 
but now a conventional and irregular unit. This leads to such problems 
as to find the number of days between two given dates, such as Sept. 30, 
1949, and June 21, 1950. Such problems are easily dealt with as fol¬ 
lows. First consider a non-leap year. Instead of numbering the days 
in the usual way, we shall simply number them consecutively, beginning 
with January i as day i, and ending with December 31 as day 365. 
’Thus Jan. 26, 1949, will be day 26 of year 1949, and can be written 
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simply as 49D26; March 3, 1950, will be day 62, or 50D62 in brief. 
Any date can be converted by the use of the following table. 


Table i6.x—Month numbers 


January (non-leap) 

0 

June 

^51 

y y 

(leap) 

— I 

July 

181 

February 

(non-leap) 

31 1 

August 

212 

) 1 

(leap) 

30 

September 

243 

March 


59 

October 

273 

April 


90 

November 

304 

May 


120 

December 

334 


To find the day number, add the day of the month to the “month 
number” given in this table: e.g. Sept. 30 = day (243 + 30) = day 273. 
June 21 = day (151 -{- 21) = day 172. To find the time between any 
two dates in the same year it is then only necessary to subtract the 
smaller day number from the larger: e.g. from noon on June 21 to 
noon on Sept. 30 is 273 — 172 = loi days. 

To find the number of days between two dates in different years 
requires two new definitions. We shall restrict ourselves for simplicity 
to the 20th century: the “year number” of a given date will then be 
the last 2 figures of the year, omitting the prefi.K 19: so 1949 has year 
number^ = 49, and 1950 has year numbcry - = 50. 'rhe date June 21, 

1949 = 49D172 has year number y ~ 49 and day number d — 172; 

Sept. 30, 1950 = 50D273, with y = ^o, d 273. We also introduce 
the “quarter year number” q, which is the quotient obtained by dividing 
the year number by 4, ignoring the remainder. Thus for 1917, y' = 17, 
^ = 4; for 1949, y — 49, q 12. 'I'he number of days between day 
d^ of year jy, (quarter year ^,) and day d.^^ of year v . (quarter year <7.,) 
is then 

« 365 0'2 — J'O t ('/a — 91) + (^/2 — . . (r6.i) 

For example, from noon on June 21, 1949 - 49D172 to noon on 

Sept. 30, 1950 = 50D273 is 

365 (50 — 49) + (12 — 12) -f (273 — 172) -- 466 days; 
from Sept. 30, 1946 = 46D273 to June 21, 1949 49D172 is 

365 (49 - 46) H- (12 - ii) + (172 — 273) - 995 days. 

These numbers are also useful in finding the day of the week for 
any given date. The rule is to divide the sum {y q -f- d) of the year, 
quarter year, and day numbers by 7; the remainder then determines 
the day as in Table 16.2. 
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Table i6.z—Determination of the day of the week 


Remainder 

Day 

Remainder 

Day 

0 

Sunday 

4 

Thursday 

I 

Monday 

5 

Friday 

2 

Tuesday 

6 

Saturday 

3 

Wednesday j 

1 




This is equivalent to the continental numbering in which the week 
begins on Monday. Thus July 3, 1951 = 510184 by Table 16.1. 

y + q + d = iz 184 = 247. On division by 7 this gives 
remainder 2, showing that it is a Tuesday. 

For leap years the scheme is slightly modified by counting Jan. i 
as day o instead of day i, Jan. 2 as day i, and so on consecutively to 
Dec. 31 — day 365. This has two advantages. Firstly, the month 
numbers have to be adjusted only for the months of January and 
February, as shown in Fable 16.i. So any date after February will 
have the same day number whether the year is leap or not: Christmas 
day will be day 359 in any year. Secondly, the rules for finding the 
number of days between two given dates and for finding the day of 
the week, apply without any change. If we had counted Jan. i as 
"day i" in a leap year these rules would have had to be modified. 

EXAMPLES 

(1) Find the number of days from June 13, 1948, to Feb. 2, 1952. 

From Table 16.1 these dates are 48D164 and 52D32. The number 

of days is u = 365 (52 — 48) -f- (13 — 12) + (32 — 164) = 1329. 

(2) What day of the week will Feb. 4, i960, be.^ 60 + 15 + 34 = 
109: this gives remainder 4 on division by 7, i.e. this will be Thursday. 

16.2 Length 

For measures other than those of time there are, of course, two 
competing systems of units, the metric system, used internationally and 
almost universal in scientific work, and the British system. 

'Fhc metric system has the advantage of having systematic sub¬ 
divisions of its units by powers of 10, according to the prefixes deci- 
(d) = io“‘, centi- (c) = lO”®, milli- (m) = lO”^ micro- (^) = io“®, 
nano- (n) lo"®, pico- (p) = lo**-. Also we have the multiples 

Dcka- (D) = 10, Hecto- (h) = lo^, Kilo- (K) = 10^ and Mega (M) 

- - 10*. 

In contrast to this the British system has irregular subdivisions, 
such as 12 inches to i foot, 3 feet to i yard, and in addition the units 
employed in Britain are slightly different from the American ones. For 
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these reasons it seems doomed to eventual extinction, in spite of certain 
advantages over the metric system. But for various reasons medical 
and biometrical data are still often collected in British units, and while 
that is so it is useful to have conversion factors. 

The standard British inch is 2*5399956 centimetres. The American 
inch is defined by the relation i metre = 39 37 inches; that is, i inch = 
2*5400051 cm. For all practical purposes both these definitions are 
equivalent to i inch = 2*54 cm. Further measures in common use are 

I mil = lO"^ inches. 

I foot =12 inches. 

I yard = 3 feet = 36 inches. 

I mile = 8 furlongs = 1760 yards = 5280 feet = 63,360 inches. 

I fathom = 2 yards = 72 inches. 


The principal conversion factors are therefore as follows (the com¬ 
mon logarithms arc also given): 


Logarithms 


I inch = 2*54 cm ^ 40483 

I foot = *30480 m 1 48402 

I yard = *91440111 1-96114 

I mile = 1*6093 km *20665 

I metre = 39 370 inch ^' 595^7 

— 3*2808 feet 5*598 

= 10936 yards *03886 

I km = *62137 miles ^79335 


\ 


16.3 Area 

This is simply the product of two lengths, and is accordingly 
measured in square inches, square centimetres, square metres, etc. 
The metric are = 100 square metres. 'I'he British system has the 

additional units i acre = 4 roods = 4840 square yards = square 

mile. Thus the principal conversion factors are: 


I yd 2 

= *83613 m* 

I ft 2 

= *092903 m* 

I in^ 

= 6-4516 cm* 

I m* 

= 1*1960 yd* 


^ - 10-764 ft* 

I cm* 

= *15500 in* 

I mile* 

= 2*5900 km* 

I km* 

= *38610 mile 


Logarithms 

T 92227 
2*96803 
•80967 
■07773 

2*03197 

1-19033 

•4*330 

I *58670 
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16.4 Volume 

A volume is measured as the product of length, breadth, and height, 
and will therefore be most naturally measured in cubic inches, cubic 
centimetres (cc), or cubic metres. The cubic metre is sometimes called 
the “stere”. 

The metric system has a second unit of volume, the litre, defined 
as the volume of i kilogram of distilled water at 4°C and 760 mm of 
mercury pressure. The exact relation is i litre = 1000*028 cc, or 
I cc = *000999972 litres. For most purposes, therefore, a litre is 
equivalent to 1000 cc. 

The British system has a whole range of extra units, based on the 
imperial gallon, which is defined as the volume of 10 pounds avoirdupois 
of distilled water, weighed in air against brass weights, at 62° F and 
30 inches of mercurj' pressure. The derived units, still used in medicine, 
are related thus: 

60 minims = i fluid drachm 

8 fluid drachms = i fluid ounce 

20 fluid ounces = 4 gills = i pint 

2 pints = I quart 

4 quarts = 8 pints = 160 fluid ounces = i gallon. 

The following are the conversion factors: 

Logarithnu 


I pint = *56826 dm® 1*75455 

I gallon = 4 5461 dm® *65764 

I in® = 16*387 cc 1*21450 

I ft® 28*317 dm® 1*45205 

I dm" = -035315 ft" 2 - 5479 S 

I CC = *061024 2*78550 

I gallon = *16054 ft® i’'20558 

I ft® = 6*2288 gallons '79442 


In the U.S.A. the gallon is defined as 231 cubic inches, i.e. as 
3-7854 dm®. 

The above relations will hold for litres in place of dm®, except that 
the last significant figure may be slightly changed. 

16.5 Mass 

The metric unit of mass is the kilogram, i gram = *001 kg, 
I tonne = 1000 kg. 

The usual British unit is the pound avoirdupois = *45359243 kg. 

This is divided thus: i dram = lb.; 1 ounce (av.) = — lb; i stone 

256 ^ ^ i6 ’ 

= 14 lb.; I quarter =- 28 lb.; r hundredweight = 112 lb.; i ton = 
2240 lb. In America the “short hundredweight*’ = loo lb. and “short 
ton” = 2000 lb are also in use. 
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There is also a second system of weights, the “Troy” and “Apothe¬ 
cary’s** system, used in medicine. The basic unit is the grain =_i- 

lb avoirdupois. The other units are the scruple = 20 grains, the drachm 
= 60 grains, the pennyweight = 24 grains, the Troy ounce = 480 
grains, and the Troy pound (rarely used) = 5760 grains = 12 Troy 
ountes. The principal conversion factors are: 



Logarithms 

I grain = 064799 8^ 

2-81157 

I drachm = 3*8879 gm 

■58972 

I troy ounce =; 31*104 gm 

1*49282 

I av. ounce = 28*350 gm 

1-45255 

I av. pound = 45359 

I *65667 

I stone = 6 3503 kg 

*80279 

I ton = I 0161 tonne 

*00691 

I gm = 035274 oz av. 

2-54745 

I kg = 2*2046 lb av. 

•34333 

I tonne = *98421 ton 

1*99309 


x6.6 Velocity 

Velocity is measured by distance divided by time. The natural 
units are accordingly feet per second, miles per hour, metres per second 
and kilometres per hour. 


I ft/sec = m.p.h. = *68182 m.p.h. 

22 ^ ^ 

I^ogarithms 

1-83367 

I m.p.h. — — ft/sec = 1*4667 ft/sec 

•16633 

I m/sec = 3*6 km/hour 

■55630 

I km/hour = m/sec = *27778 m/sec 

1 0 

1*44370 

I ft/sec = *3048 m/scc 

1*48402 

I m/sec = 3*2808 ft/sec 

•5*598 

>6.7 Derived units: the M.K.S. and C.G.S. systems 



From now on we deal with more complicated quantities, such as 

forces, pressures, etc.; and fortunately the units are con¬ 
siderably simplified. For it is usual to express such quantities in a 
consistent system of units, derived from a single unit of time, a single 
unit of mass, and a single unit of length. 

In all systems the unit of time is the second. The standard units 
o mass and length are the kilogram and metre respectively, and the 
erived units form the metre-kilogram-second or M.K.S. or Giorgi 
system. Sometimes the gram is chosen as the unit of mass and the 
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centimetre as unit of length, and we obtain the centimetre-gram-second 
or C.G.S. system. The M.K.S. system has the advantage that it 
includes the common electrical units, such as the volt, the ampere, and 
the watt. The C.G.S. system does not agree with these electrical units, 
which in itself is inconvenient, and moreover it gives units which are 
of rather awkward sizes for most purposes, e.g. the “erg” or energy 
unit is too small to be convenient. Fortunately the two systems differ 
only by powers of 10, and so are readily interconverted by a shift in 
the decimal point. Thus a velocity of i cm/sec = -01 m/sec, and an 
acceleration of i cm/sec* = *01 m/sec®. 

16.8 Change of units 

Almost always we measure a velocity by dividing the distance 
covered by the time: and to express this division we write the unit of 
velocity symbolically as the quotient, “unit of distance/unit of time , 
as “metres/sec” or “miles/hour”. Such a choice is very natural, and 
reduces the arithmetic to a simple division. But it is not inevitable. 
It would be possible to measure distances in feet, time in seconds, 
and velocity in miles per hour. In that case, a distance y feet covered 

in a time t seconds corresponds to a velocity oi v ==■ ~ yjt miles per 

hour. Such a choice of units means that the relation v = y!t no longer 
holds; but for some very special purposes, such as the timing of cars 
or trains, it might be more convenient than the usual choice of feet 
per second. A similar situation could arise in metric units if we measured 
the distance v in metres, the time t in seconds, and the velocity v in 
kilometres per hour. The relation would then be f = 3’6y//. We can 
say that when we take i metre as the unit of distance, and i second 
as the unit of time, then i metre/sec is a “derived” unit of velocity, 
or to use an alternative phrase, a “germane” unit. It can be defined 
as that unit which simplifies a particular formula. Thus if a length 
L. is measured in metres, the natural derived unit of volume V is the 
cubic metre or stere, since the formula for the volume of a cube is 
then V = L^. W'e could if we liked choose another unit as the unit of 
volume; for example we could take the volume of a sphere of radius i, 
which might be called a “spherical metre”. The volume V of a sphere 
of radius R would then be V = spherical metres, instead of V = 
tirR^ cubic metres. But here experience shows that the cubic rnetre 
is a much more useful unit than the spherical metre—not by divine 
command, but simply because we find it so. If natural bodies were 
commonly of spherical shape it might be other\vise. Both the cubic 
and spherical metres could be considered as derived units, but here 
experience shows us which one to take. 

Sometimes it may be convenient to change the principal units: it 
is then important to know what happens to the derived units. Suppose 
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for instance that a velocity has been estimated to be 8 yards/minute, 
and we wish to change the units to inches and seconds. What then will 
the velocity be expressed in inches/sec? 

Now a velocity of 8 yards/minute means one in which 8 yards = 
8 X 36 inches are covered in i minute = 60 seconds. Since the 
velocity is calculated by dividing the distance gone by the time taken 
—for that is the meaning of our derived units—this velocity will be 

—inch/sec = 4 8 inch/sec. But we can also obtain this answer by 

treating the symbolical division “yard/min.” as if it was an actual 
division: 


o j / • 8 yards 8 x 36 inches o • l 

o yards/min. = ——. --- = -,-= 4-8 inch. 

I min. 60 sec 


sec. 


Similarly a volume of 1 metre® means the volume of a cube of side 
I metre, i.e. 100 cm, and so by the formula V = L®, where V is in cc 
and L. in cm, this is 100® cm®. But we could also get this result by 
writing (i metre)® = (100 cm)® = 100® cm®, treating the symbol “cm” 
exactly like any ordinary algebraic symbol. This pleasing property 
holds throughout the whole range of derived units, and makes their 
interconversion an easy matter. 


16.9 Force, energy and power 

In the metre-kilogram-second system the unit of force is the newton, 
defined as the force needed to give a kilogram mass an acceleration of 
I metre/sec®. In general a force / newtons acting on a mass m kilo¬ 
grams produces an acceleration Dt^y metres/sec® according to the law 
‘‘force = mass X acceleration” 

f = m D^y .... (16.2) 

^^y> the second derivative with respect to the time, is often denoted 
by y (read as “y double dot”). Similarly the first derivative /),v of 
the position y, or in ordinary language the velocity, is often written 
as y (read, “y dot”). 

In the e.G.S. system the unit of force is the dyne, which gives a 
mass of I gram an acceleration of i cm/sec®. In the British (foot- 
pound-second) system the corresponding unit is the poundal, which 
gives one pound an acceleration of i ft/sec®: but this is very rarely 
used in practice. The conversion factors arc 

I dyne = lO”® newtons 
I poundal = *13825 newtons 

The work done by a force in moving an object is measured by the 
product of the distance gone times the component of the force in the 
direction of motion. The M.K.S. unit of work is the joule — newton 
X metre = kg . m®/sec®. The C.G.S, unit is the erg =: dyne x cm = 
gm . cm®/sec®. Thus i erg = lo-® joule. 
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centimetre as unit of length, and we obtain the centimetre-gram-second 
or C.G.S. system. The M.K.S. system has the advantage that it 
includes the common electrical units, such as the volt, the ampere, and 
the watt. The C.G.S. system does not agree with these electrical units, 
which in itself is inconvenient, and moreover it gives units which are 
of rather awkward sizes for most purposes, e.g. the “erg” or energy 
unit is too small to be convenient. Fortunately the two systems differ 
only by powers of lo, and so are readily interconverted by a shift in 
the decimal point. Thus a velocity of i cm/sec = -oi m/sec, and an 
acceleration of i cm/sec** = -oi m/sec®. 

i6.8 Change of units 

Almost always we measure a velocity by dividing the distance 
covered by the time: and to express this division we write the unit of 
velocity symbolically as the quotient, “unit of distance/unit of time , 
as “metres/sec” or “miles/hour”. Such a choice is very natural, and 
reduces the arithmetic to a simple division. But it is not inevitable. 
It would be possible to measure distances in feet, time in seconds, 
and velocity in miles per hour. In that case, a distance y feet covered 

in a time / seconds corresponds to a velocity ^ ^ miles per 

hour. Such a choice of units means that the relation v — longer 

holds: but for some very special purposes, such as the timing of cars 
or trains, it might be more convenient than the usual choice of feet 
per second. A similar situation could arise in metric units if we measured 
the distance v in metres, the time t in seconds, and the velocity v in 
kilometres per hour. The relation would then be 

sav that when we take i metre as the unit of distance, and i second 
as the unit of time, then i metre/sec is a "derived” unit of velocity, 
or to use an alternative phrase, a "germane” unit. It can be defined 
as that unit which simplifies a particular formula. Thus if a length 
L is measured in metres, the natural derived unit of volume V is the 
cubic metre or stere, since the formula for the volume of a cube is 
then V = L^. We could if we liked choose another unit as the unit of 
volume; for example we could take the volume of a sphere of radius i, 
which might be called a "spherical metre”. The volume V of a sphere 
of radius R would then be T spherical metres, instead of V = 

%7tR^ cubic metres. But here experience shows that the cubic inetre 
is a much more useful unit than the spherical metre—not by divine 
command, but simply because we find it so. If natural bodies were 
commonly of spherical shape it might be otherwise. Both the cubic 
and spherical metres could be considered as derived units, but here 
experience shows us which one to take. 

Sometimes it may be convenient to change the principal units: it 
is then important to know what happens to the derived units. Suppose 
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shall call it SF instead of F, meaning “a little bit of volume”. Then 
the mass 8m contained within it will also become small, and common 
experience suggests that the density will be practically uniform through¬ 
out 8F. In calling SF “small” we mean that it must be small in length, 
in breadth, and in height: a very long thin body could have a very small 
volume but vary considerably in its density. One can further imagine 
this element 8F of volume shrinking ever smaller and smaller, so that 
the ratio Sm/SF or average density tends to a definite limit p which 
can be called the “density at the point” and denoted by dmldV. Strictly 
speaking this is nonsense, for below a certain size we should notice the 
presence of atoms, and the density will not be uniform. But it is a 
convenient idealization to speak as if the limit did exist, and will not 
do any harm provided that its limitations are clearly recognized. 

Actually the density so defined is only a special kind of density. 
If the mass is concentrated along a line whose thickness can be neglected, 
such as a thin wire, we can speak of the “line density” A measured in 
kilograms per metre, or grams per centimetre. A will be the limit 
of the ratio hmfhL = mass/length for a small length 8L of the wire, 
as hL o. If the mass is concentrated on a surface it will have a 
surface density” a kg/m*, measured as mass per unit area. We can 
also speak of densities of other quantities than mass. An electric charge 
can have a line, surface, or volume density; a surface density of a force 
is called a “pressure”, and so on. 

i 6.12 Multiple integrals 

Imagine a curve C(Fig. i6. i) on which the line-density of a substance 
A (A need not be constant, but can vary from one point of the curve 



Fig. z6.x—Single and double integrals 


to another.) If then we take a small length SL of the curve, the mass 
^fn contained in this will be approximately A8L, and therefore the total 
mass on the curve will be approximately A/ = T Srn £ X 8L. If we 
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suppose these small lengths hL to be cut up into still smaller lengths, 
the formula will approximate even more closely to the total mass: and 
in the limit, as all the hL tend to zero, we shall get the exact formula 

M = f ^ A dL. This is an ordinary definite integral: the suffix C 

means that the integral is taken along the curve, with the two ends of 
the curve as limits of integration. 

Now suppose we have a surface S on which the surface density is 
<s (Fig. i6.i) (a may vary from point to point of the surface). Let us 
cut this surface up into small areas BA; that is, these areas must be 
small both in length and breadth. Then the mass contained in the area 
BA is approximately Bm 2:: a . BA, and the total mass is M = Bm 2: 
2 * *T . BA. A division of these areas into still smaller ones will improve 
the approximation: and as all the small bits of area BA tend to zero 
the sum 2 * a . BA will tend to the total mass Af as a limit. Now this 
is analogous to an ordinary definite integral, but is of a new kind. It 

is called the “double” or “surface” integral o . dA. 

In the same way the mass of a three-dimensional body B of volume- 
density p (possibly varying from point to point) can be found by cutting 
it up into small elements of volume 8F. Each of these will contain a 
mass Bfn ~ p 5 F, and the total mass AI will be approximated to by 
the sum S p BV which will tend to the limit M as all the volume ele¬ 
ments BV tend to zero. This limit is called the “triple” or “volume” 


integral M = y^ p dV, 

If the surface S is plane there is a simple interpretation of the double 
integral f . g dA. We can represent c graphically by taking an axis 
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perpendicular to S as an axis of a; the value of <j at any point P on the 
surface will then be plotted as a point P' at height a = PP' above P. 
These points P' will lie on a curved surface S' (Fig. 16.2). The integral 

CT dA will then be equal to the volume between the surfaces S and 
S'. (This is analogous to the representation of the ordinary integral 
f y dx zs the area between the x»axis and the curve for y.) For if we 


take any small element of area SA in S and erect above it a column of 
height a reaching up to the surface S', the volume contained in this 
column will be approximately area of base times height, i.e. a BA. The 
total volume contained in all such columns will be 2 * BA, approxi¬ 


mately: and in the limit this sum will become J' a dA by definition. 

This in effect replaces a surface distribution of variable density a by 
a volume distribution of uniform density i but variable height o; and 
naturally this gives the same integral. 


*6.13 Repeated integrals 

How in practice can we calculate such a double or triple integral? 
One method is the following. Consider for the sake of simplicity a 

double integral J dA taken over a plane area S. In this plane we 

can introduce co-ordinates x and y. Instead of using a division into 
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small areas SA of arbitrary shapes, we divide the surface into small 
rectangles by two parallel systems of thin strips, one parallel to the 
y-axis, the other system parallel to the jf-axis. A strip of width Sx 
parallel to the y-axis will intersect one of width 8y parallel to the x-axis 
in a small rectangle of area SA = Sx . Sy (Fig. 16.3). The total mass 

M = J'dA will therefore be approximated by the sum Z Sm = 
Z (o S^) — Z {^G . 8:1: . 8y). 

Now concentrate attention on a single strip of width Sx parallel 
to the y-axis. We shall suppose that it lies between the x-co-ordinate 
values X and x + S.v, and that the lowest point on this strip has y- 
co-ordinate Y^, and the highest point has y-co-ordinate 5^2* with all 
intermediate points belonging to the strip. (That is, we take S to be 
of a fairly simple shape, with the property that each vertical line cuts 
it in one range only. A shape like the crescent moon, which can be cut 
twice by a single vertical line, will not be covered by our argument: but 
as a rule it is not difficult to make suitable allowance for this complication 
when it occurs in practice.) 

We notice that K, and are functions of x which depend on the 
shape of the boundary curve. 

Now the total mass contained within the vertical strip is approxi¬ 
mately Z a SA = Z G Sx . Sy, summed for all elements of area con¬ 
tained in this strip. But here Sx is constant, and we can write this 
sum as Sx . Z G Sy, summed over all elements of the strip, i.e. between 
y = V’, and y = Y2^ Now this sum is the ordinary approximation 
to a definite integral: so the total mass in the strip is approximately 

Sx . cdy 

where the integration is performed with respect to y only, keeping x 
fixed at the value x. To find the total mass we now have to add all these 
strips together: that is 

M 2 ^. Z dy^ Sa:. 

This summation is performed over all possible values of x, say from 
A*i to A*2, where x^ is the least value of a* and Xz the greatest (Fig. 16.3). 
In the limit when we take both the Sa and the 8y strips to tend to zero 
in thickness this sum will again become a definite integral, and we shall 
expect exact equality: 

M = ^dA = (/^’ ady)dx . (16.3) 

The right-hand side here is called a “repeated integral” and is 
usually written J * J^ dy dx or J Jdy dxy without the brackets. 

(Some text-books write it as J J a dx dy\ which does not seem 
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SO logical a notation.) The interpretation is that the inside integral 
is to be performed with respect to_y, keeping x constant; and the limits 
YI and Y g are the boundary values of y for the particular value of x. 

EXAMPLE 

(i) Find the mass contained within the right-angled triangle S with 
vertices at O, (i, o), and (i, i), given that the surface density g at the 
point {x, >') is G = X + j (Fig, 16.4). 



Fig. 16.4—The mass of a triangle of varying density 


Here Kj = o, Y2 = x, and Xi = o, X2 = i. The mass is therefore 


f GdA 

J s 


= /o (/o 

= J\ + b'-Jo 

= (v^ + I v^) dx 

= [i - i = i. 


Similarly a volume integral M = J can be expressed as a 

repeated integral 

M=^ f^pdzdydx . . (.6.4) 

with the following interpretation: x, y, and z are rectangular cartesian 
co-ordinates. We begin by integrating p with respect to Zy keeping 
X and constant. The limits of integration for p are and Z,, the 
least and greatest values of z for the particular values of x and y\ i.e. 
^1 and Zj are functions of x and y. This integral is now in turn 
integrated with respect to ^ only between the limits Fj and Y2, which 
are the least and greatest values of y for a given value of .v. 'I'he final 
integration is with respect to x. 

CO 
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Sometimes (Fig. 16.5) it is useful to take polar rather than cartesian 
co-ordinates in evduating a surface integral. Imagine the area divided 



Fig. 16.5—A double integral in polar co-ordinates 


Up into thin circular strips of width Sr and thin radial wedges of angle 
S 0 . These intersect in small areas 8 A which are nearly rectangular. 
The radial width of such a rectangle is Sr; the perpendicular width is 
r S 0 , by the formula for the length of a circular arc. The area 8 A is 
therefore approximately r 80 8 ry the mass contained within it is approx¬ 
imately or 80 Sr, and the total mass within S is approximately 
£ a r 80 Sr. In the limit, as the thickness Sr of the circular strips 
and the angle S0 of the radial wedges both tend to zero, this will become 

a repeated integral; either J'J dO dr y if we integrate first with 

respect to and afterwards with respect to r, or f f dr d0 if the 

first integration is with respect to r. Thus 

= f = f f dd dr = f f ^ar dr dd (16.5) 

In each case the integrals are to be taken between the appropriate 
limits, '^rhus if we begin by integration of cr with respect to 0 , keeping 
r constant, the limits of integration will be the e.xtreme values of 0 
possible for that value of r. The second integration will then be over all 
possible values of r. Conversely, if we begin by integrating with 
respect to r, keeping 0 fixed, the range of integration will be over all 
possible values of r for that value of 0 . 


FURTHER EXAMPLE 

(2) Find the mass contained within a circle of radius i, given that 
the surface density is proportional to the distance from the centre, 
i.e. G = ivr. 
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Here M= f f ^Kr- dB dr. The first integration will be with 
respect to 6 : here the polar angle $ will range from o to ztt to cover 
the whole circle, so that Kr~ dd = zirKr-. This has now to be 
integrated over the range of values of r, that is, from o to i; A/ = 
fl dr = = lr,K. 

16.14 Centres of gravity 

Suppose we have a system of small particles of masses w,, 

7 Tt 3 ... . tri„, with total mass M = m.. = £ rtia. 

Suppose further that we define the “moment” of any particle about 
a given plane 77 to be the product of the mass of the particle times 
the perpendicular distance from 17 . Then it is shown in text-books 
on mechanics that there is a point G such that the total moment of the 
whole system about FI is equal to the moment of a single particle of 
mass M placed at G: and this is true whatever plane FI we take. G is 
then known as the “centre of mass” or “centre of gravity” of the system. 
In particular the moment about any plane through G is zero; it follows 
that if the system is rigid it will exactly balance if supported at the 
centre of gravity. 

Consider first a distribution of particles in a plane. Let (av, yv) be 
the co-ordinates of the particle w.. (Fig. 16.6). Then the position of 



Fig. 16.6—The centre of gravity G of a system of masses in a plane 

G can be found in the following way. We shall call the co-ordinates 
of G (A.y); tins is the usual notation—here the bar over the letter 
means “centre of gravity” and has no connection with the former use 
of a bar to denote the complex conjugate. Now if we take moments 
about a plane through the origin O perpendicular to the A-axis then 
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the moment of the mass is since Xf is its distance from the 

plane. It follows that the total moment of all the particles is S m^jpCa. 
This must be equal to the moment Mx of the hypothetical particle 
of mass M placed at the centre of gravity, whence 

Mx = S triaXa 

Similarly taking moments about a plane through O perpendicular to 
the y-axis gives My = S may<i> G is therefore the point with co-ordin¬ 
ates (x, y) given by 

X = 2 ntaXalM, y = ^ ntayalM. 

For a set of points in three dimensions we have similarly a centre of 
gravity G with co-ordinates (: 5 , y, z) where 

X = 2 ?naXalM; y = 2 may’alM ; z = 2 ntaZajM . (16.6) 

Alternatively we can regard equations (16.6) as the definition of the 
centre of gravity. 

One point perhaps calls for comment. The equation for x can be 
written in the form 

- = WiA’i + m^Xj + + . . . + m„x„ . ^ , 

OTi + W2 + m3 + . . . + m„ ‘ ^ 

Now if all the masses mj, . . . m„ are equal this means that .v is 
the ordinary average (a;i -f-Xg + . . -h of the co-ordinates Xi^ 

X2, . . . x„. If the m^’s are unequal we therefore call x the weighted 
average or weighted mean of a*,, Xg . . . with weights mj, mg, . . m„ 
respectively. 


PROBLEMS 

(1) A system consists of three particles. One is at (o, o) and has 
mass I, one is at (i, i) and has mass 2, and one is at (3, o) and has mass 
3. Where is the centre of gravity ? 

(2) A system consists of a mass i at (o, o, o), a mass 2 at (i, i, i), 
a mass 5 at (2, i, o) and a mass 2 at (i, 2, i). Where is the centre of 
gravity ? 


Now consider a plane area S over which the mass is continuously 
distributed with surface density c, instead of being concentrated in a 
few points. Then each small element of area SA will contain a mass 
c 8 A ; and in the definition of the centre of gravity according to equa¬ 
tion (16.6), instead of writing x = 2 maXa/M, we shall have x^ 
2 (7X 8 A/M, where “a” is the x-co-ordinate of the element SAy and 
the summation is over all such elements. If we divide the area into 
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smaller and smaller pieces BA, whose maximum size tends to zero, 
this sum will tend to the limit 

x = M-^f^axdA . . . (i6.8) 

and similarly y ~ f ^ <jy dA. 

These formulas will give the position of the centre of gravity for 
any plane sheet. For a three-dimensional body B of density p the 
discrete formulas (i6.6) will be similarly replaced by 

X = f px dV : y = M“* f py dV ; 

z = f pz dV . . . (16.9) 

As a rule these integrals can be found most easily by turning them 
into repeated integrals. 

EXAMPLES 

(1) What is the centre of gravity of the triangle of Fig. 16.4 which 
has surface density c = x y} 

As we have already shown, the mass M — J g dA = So 
X = M-^ J GX dA = z (.V y) X dy dx 

~ ^ dx 

= I = I 

y = M-^ f GydA = z fl /J(.v -\-y)ydydx 

^ ^ fl [^xy^ + 

= 2 /* dx 

= A [x*U = 1% 

The C.G, is therefore at (|, -/a). 

(2) Taking the same triangle, where is the centre of gravity if the 
surface density is uniform, say c = i ? 

The mass is now 

M = f cdA = f\fl I . dy dx 

= /* xdx = i. 
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(This also follows from the formula: mass = surface density X area 
= I X ^ — i-) 

X = M-^ J ax dA = 2 J ^x dy dx 

= 2 /’ [xy]% dx 
= 2 x^ dx = § 

y = M~^ J ay dA = 2 J*y dy dx 

= 2 /* 

= /’ -v^ dx = \ 


The centre of gravity is therefore at the point (§, J). This is also 
the centroid, or intersection of the three medians of the triangle—a 
property which can be shown to be true in general for any triangle of 
uniform density. 


16.15 Moments of inertia 

The “moment of inertia” (or “second moment”) of a mass m about 
a line L is defined as the product of m times the square of the distance 
of m from L. For a system of masses the moment of inertia about L 
is the sum of the moments of inertia of the separate masses, i.e. if //,. 
is the distance of the mass from the line L, 

/= moment of inertia = 2* Wa/ia* • . (16.lo) 


If instead of discrete particles we have a continuous distribution 
of mass over an area Sy then in the above formula the point mass 
must be replaced by a SA, the mass of the element of area S* 4 ; and 
will be replaced by h, the distance of this element from the line L. 


The sum Z fnjia" becomes Z ah- SA, or in the limit I = f a/t^ dA. 
Similarly if the mass is continuously distributed over a volume B with 


density p, the moment of inertia will be J‘ ph- dV. If a body is rotated 


about an axis L with angular velocity to radians per second, then it can 
he shown that its kinetic energy is and its angular momentum Iw, 
where I is the moment of inertia about the axis. (Compare with the 
formulas hnv- == kinetic energy' and mv = momentum for a mass m 
moving with velocity t*.) Also if the body is acted upon by a set of 
forces with total moment ft about the axis L, the angular acceleration 
diujdt is given by' p. — Idmldt. 

As a particular case of our formula the moment of inertia of a plane 
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sheet S about the jc-axis is ^ ay- dA, and that about the ^’-axis 

is ax^ dA. 


EXAMPLE 

(i) To find the moment of inertia of a square of side L and surface 
density a about an axis through the centre parallel to one of the sides. 

Take the centre of the square as origin of co-ordinates, with axes 
parallel to the sides. Then x varies from —to and y from —AL 
to ^L, The moment about the A;-axis is therefore 

1 = f s'^y^dA = o;j.= dy dx 

= I'j oZ,’ dx — I'.T a L'. 

For a three-dimensional body the moment of inertia about the .v-axis 
is similarly given by 


16.16 Pressure 

By the “pressure” on a surface is meant the force acting perpendicu¬ 
lar to the surface per unit area, i.c. the surface density of the per¬ 
pendicular component. The surface density of the component of force 
tangential to the surface is called the “shear”. 

The M.K.S. unit of pressure is therefore the newton/m-, and the 
e.G.S. unit the dyne/cm^ = 1*^ nemon/rn^. A unit which is widely 
used is the bar which = 10'^ newton/m® = 10® dyne/cm®. i millibar == 
To’oTJ bar. 

In practice there are a number of other units of pressure some¬ 
times used: the principal one is the atmosphere, defined as the pressure 
of 76 cm of mercury at C. Unfortunately this depends on the value 
chosen for^: as a rule (but not invariably )g is taken to have the standard 
value 9 80665 m/sec*: and since the density of mercury at o® C is 
*3595 09 kg/m® we have the conversions: 

Logarithms 


I cm mercury = 013332 bar 212490 

I atmosphere = 1013250 bar 00572 

I kg wt/cm® = 98066 bar 1-99152 

I lb wt/in® = -068948 bar 2-83852 


A body subjected to an atmospheric pressure of 1 standard atmo¬ 
sphere, i.e. 1-013250 bar, and at o® C, is said to be at “normal temnera- 
ture and pressure” (N.'I’.P.). ^ 
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16.17 Viscosity 

When different parts of a fluid are moving with different velocities 
a stress is set up owing to the viscosity of the fluid. If the velocity 
becomes too great the motion breaks up into eddies, and becomes 
mathematically very intractable. But if the motion is slow and steady 
it can be calculated in the following way. 

We shall take the simple case of steady parallel motion, as down a 
tube or channel. Suppose that the motion is in the direction D (Fig. 
16.7): and suppose that at a point A the velocity is while at a neigh¬ 
bouring point B (where the line AB is perpendicular to the direction 



Fig. 16.7—Motion of a viscous fluid 


of motion) the velocity is + hv. Let us take the .v-axis of co-ordinates 
parallel to AB^ so that A has co-ordinate x, B has co-ordinate x Sx^ 
and therefore the distance AB = Bx\ Now draw planes III and TJ^ 
through A and B respectively perpendicular to the line AB. The 
viscosity will cause a shearing force in these planes. If Bv is positive, 
so that the fluid is moving more rapidly upwards at B than at Ay there 
will be an upward shearing force at B acting on the fluid benveen A 
and By tending to accelerate its motion, and a downward shear at A, 
tending to retardation of the fluid. In other words each portion of 
fluid tends to pull the neighbouring portions along with it. This shear 
S will of course be measured as force per unit area, i.e, as ne\nons/m* 
in M.K.S. units and as dynes/cm- in C.G.S. units. Now it is natural 
to suppose that S will be directly proportional to the velocity difference 
St?; a little thought will also suggest that it will be inversely propor¬ 
tional to the distance S.v, since the further apart the points A and B are 
the less effect the difference of velocity will have. Putting these sug¬ 
gestions together, we expect S to be a constant multiple of BvjBx, at 
least when Bv and S.v are small: or more precisely, we expect the shear 
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at a point A to be proportional to the value of dvjdx at that point. 
Experiment confirms that this is so: we can write 

S = 7 ) dvjdx .... (i6 .li) 

where t) is the measure of the viscosity. This can also be written 
= Sdxjdv. Now in M.K.S. units S is measured in newtons/m*, 
a: in metres, and v in metres/sec; -q has therefore the unit (newton/m^) 
X m X (sec/m) — newton . sec/m* = kg sec"^ m-h This unit is 
called the dekapoise. The corresponding C.G.S. unit is the poise = 
dekapoisc. 

We can now find at what rate a liquid will flow down a long thin 
circular tube. Let the length of the tube be L, and its radius R (see 
Fig. 16.8). For the sake of clarity the radius of the tube is greatly 



exaggerated in comparison with the length: the reader is asked to 
imagine that L is very much longer than is shown. We shall take the 
pressure at the nearer end of the tube (where the liquid enters) to be 
P i, and at further end (where the liquid leaves) to be P^- 

Now common sense suggests that the velocity v of the liquid at 
any point A within the tube will depend only on its distance r from the 
axis (supposing that the size of the tube and the pressures P^ and P., 
are given). Let us draw two cylinders, one of radius r around the axis^ 
and one of slightly greater radius r + 8r; at the surface of the inner 
cylinder the velocity will be v, and at the surface of the outer one it 
Will be + hv. There will therefore be a shear approximately equal 
to jj SvjSr, or accurately rj dvjdr, acting on the surface of the inner 
cylinder. Since the total area of this curved part of the surface is 
27 rrL, this means that there will be a total backward force ~27rrLrj dvfdr 
due to viscosity on the inner cylinder. For steady motion this must 
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be balanced by the difference of pressure at the two ends. Since the 
area of each end is tjt-, this forward pressure amounts to (Pi — 
so that —ztitLij dvjdr — (Pi or dvjdr = — ^(Pj — P^rjLT}. 

This is a differential equation which can be solved by direct integration; 
the solution is 

V = J {dvjdr)dr = — J ^(Pj — P^drlLr} 

= -\{P,-P^ViLyi +C 

The constant C is fixed by the fact that at the boundary of the tube, 
when r = R, the liquid sticks to the boundary wall, and so v = o. 
Thus C = i(Pi — P^R'jLy], and on substituting this value we obtain 

z- = i(Pi - P^XP^ - O/Lt?. 

Now let us find the total volume V of liquid passing through the tube 
in time t. Return to Fig. 16.8 and consider the cylindrical shell between 
the two cvlindrical surfaces of radii r and r + Sr respectively. Within 
this shell the velocity will be approximately v (neglecting the correcting 
term St). But the area of the ring at the end, which lies between two 
circles of radii r and r + 8r, will be the circumference of the circle 
times the width of the ring, i.e. 27 rr Br. So in time t the volume of 
liquid passing down the shell will be (velocity X time X area of end), 
i.e. znrvt Sr. The total volume passing down the whole tube will be 
the sum of the volumes passing down all such rings, i.e. T z-rrrvt Sr or 

more accurately f z-rrrvt dr. Since v — f(Pi — P^{R~ — 
integral becomes 

= fo K/’. - 

This is, by the way, less formidable than it seems, as most of the letters 
denote constants and can be taken outside the integral. Thus 

V ^ UL-^-V(P, - P,) /„^(PV - r^)dr 

== i 7 rL->T,-i/(P, „ 

= i77L-*7?-l/(P, - P,)P^ . . . (16.12) 

This is Poiseuille’s law. Expressed in the form rj = J7r/(P, — P^R^jVL 
it can be used to determine the viscosity experimentally. Typical 
values are: for water at 20'’ C, -01006 poise; for ethyl alcohol at 20® C, 
•0119; for air at 23^0, 1-830 X lO"^ poise. 

EXAMPLE 

(i) Brodie has used Poiseuille’s law to find the pressure necessary 
to drive the urine out of the kidneys. He found that urine was dis¬ 
charged from one kidney containing 142,000 glomeruli and tubules at 



PHYSICAL AND CHEMICAL MAGNITUDES 


459 

the rate of i cc per minute, i.e. at the rate of 1/(142,000 x 60) cc per 
second per tubule. Now Poiseuille’s law can be written 

P\ — P2 = ^LyiVjiTtR^ dynes/cm^ 

= 8Z/i7F/i3337r//?* millimetres of mercur>^ 

Putting t = ly V = 1/(142,000 X 60) cc for each tubule we can 
make the following calculations: 



Length L 

(cm) 

Radius R 

1 {io~* cm) 

Loss in 
pressure 
(mm Hg) 

Proximal convoluted tubule 

Loop of Henle — 

\ 

1-2 

i 

6 

15 

descending limb 

•9 

1 5 

23 

ascending limb 

1 -9 

1 4-5 

35 

Distal convoluted tubule 

•2 

9 1 


Collecting tubule 

' 2-2 ; 

8 ' 

9 

Total loss in pressure ' 

1 

' 

82 i 


Now as the mean aortic blood pressure was 120 mm I Ig, and the 
loss in pressure between the aorta and glomerular capillaries is probably 
35 mm Ilg, this gives a pressure of 85 mm Ilg in the glomerular 
capillaries, i.e. about enough to account for the secretion of urine. 
Brodie concluded that the pulsation of the glomerulus does not account 
for the flow of urine in the tubules, and attributed a secretory action 
to the glomerular surface. (But the bulk of opinion is against this 
inference.) 


16.18 Heat 

Heat is usually measured in units of i calorie. 'Phis has two varieties, 
the small or gram-calorie (cal.) defined as the heat required to raise 
1 gram of water through i*" C; and the large or kilogram-calorie (Cal.) 
which equals 1000 gram-calories. Since the heat required to raise a 
given mass of water from o'’ to i®C differs from that required from 
15® to 16”, or from 50'’ to si*’, a further specification is necessary to 
make these units perfectly definite. 'Phe most usual one is to take the 
gram-calorie as the heat required to raise i gram of water from 

gram-caloric”. A third kind of calorie is 
the “international steam table caloric” (I.T. cal) which is appro.xi- 
mately 1-0005 i5'’-gram-calories. 

Latent heats of fusion and vaporization are usually c.xpressed in 
units of I gram-calorie per gram, c.g. the latent heat of fusion of icc 
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is about 80 gm-cal/gm, and the latent heat of vaporization of water 
at roo° C is about 538 gm-cal/gm. 

Since heat is a form of energy, it can also be expressed in energy 
units, i.e. joules in the M.K.S. system or ergs in the C.G.S. system. 
The conversion factor, known as “Joule’s equivalent of heat” is (for a 
i5°-gm-calorie) 

J — 4*185 joules/cal — 4*185 X 10’ ergs/cal. 

Logarithms 

I calorie = 4*185 joules -62170 

I joule = *23895 cals 1-37830 

Since the definition of the calorie is, as we have seen, both arbitrary 
and somewhat indefinite, it seems rather surprising that it has not so 
far been generally replaced by the absolute unit, the joule. 

16.19 The method of dimensions 

Almost always a physical or chemical equation is expressed in a 
form which is independent of the system of units. Thus the equation 
for the volume of a cube of side L is V = L®, and for that of a sphere 
of radius is F = We do not need to specify the units employed 

in these formulas, provided that we understand that if L and R are 
measured in inches then F is expressed in inch®, if L and R are centi¬ 
metres then F is in cc, and if L and R are in metres then F is in m®. 
Similarly the distance a body falls in time t is it being imderstood 
that the appropriate value of g is to be substituted: and in the same way 
Poiseuille’s viscosity equation will hold good in any system of units. 

One reason why we write equations in this way is that it is con¬ 
venient to be able to choose the system of units at will. But the chief 
reason probably is that our units—such as the metre, the kilogram, and 
the second—are arbitrarily chosen and have little or no physical signi¬ 
ficance; and it would be absurd for such arbitrary units to obtrude 
themselves in an equation expressing a natural law. Thus we quite 
naturally express our equations in an “absolute” form, without taking 
any special thought in the matter. 

This property of physical equations, that they are true in any system 
of units, has a very important consequence. It implies that whatever 
units we use, the unit in which the left-hand side of the equation is 
expressed must be equal to the unit on the right-hand side. Thus an 
equation such as F — L® connecting length and volume can be ruled 
out at once as impossible: for F will be expressed, say, in metre®, 
while L- will correspondingly be in metre®, which is a different unit. 
Even if we suppose for a moment that w’e could find a system of units 
in which the relation F = L® was true, it would cease to be true when 
we changed the units. Actually of course we know that there is no 
system for which F = L® is true for arbitrary values of F and L: but 
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it does happen to be numerically true for the particular values L = 1 
(metre), V = i (metre^). However, when we change the units to 
centimetres, L becomes 100 cm, and V becomes 10® cm®, and is 
no longer equal to V. Thus the falsity of the equation is demonstrated. 

This equality of units provides a very useful check on equations: 
this check is known as the “method of dimensions”. Thus suppose 
we were uncertain whether the correct equation for the period t of 
oscillation of a pendulum of length L was t = Z 7 T\/{Llg) or 
t = 2n^{gjL). We try putting in suitable units, say M.K.S. ones. 
The time t will be expressed in seconds, g in m/sec®, and L in m. Thus 
2n\/(Llg) will be in units = \/(sec^) = sec, whereas 

^'"VigiL) will be in units of \/[(m/sec®)/m] = V(i/sec®) = i/sec. 
Only the first formula t = z-TT\/'{Ljg) gives the same units on both 
sides of the equation, and so it is the correct one. In fact it is possible 
to say rather more. If it is known that the period of oscillation of a 
pendulum is a function only of its length L (metres) and of the accelera¬ 
tion due to gravity ^ (metres/sec^), it can be deduced that the connecting 
formula must be f = K\/{Llg), where is a numerical constant. For 
no other combination of L and g has a second as its unit. Even if one 
did not know for certain that t depends only on L and g, the study 
of units would still strongly suggest the relation t = K\/{Llg). 

We have said above that physical equations are true independently 
of the system of units. This is not quite true without qualification; 
it is necessary that similar definitions should be used for derived units 
in different systems—a condition which is almost always satisfied in 
practice. To illustrate the point, imagine that a tribe is discovered 
in Central Africa, called the Ngboglus, who use as their measure of 
length the radius of a Sacred Jewel, which is perfectly spherical in 
shape, and that as their unit of volume they use the volume of the 
Sacred Jewel. In their units the volume of a sphere of radius R is no 
longer grrT?®, but is now f?®; and the volume of a cube of side L is for 
them The reason is simply that they have a different definition 

for the derived unit of volume: we use a unit cube, they use a unit 
sphere, and because of that their equations are different. But since 
all commonly accepted systems of units use the same methods of 
defining derived quantities, this particular complication' can be ignored 
(unless of course one wished to do business with the Ngboglus). 

x 6 . 2 o Dimensional constants 

We can always select arbitrarily a unit to measure any physical 
niagnitude. Thus we can take the metre, or the inch, or the height of 
Mount Everest, or the mean distance from the earth to the sun as a 
unit of length; and we can take the litre, or the gallon, or the fluid 
ounce, or the Sacred Jewel of the Ngboglus as a unit of volume. Of 
course a litre is defined officially in terms of a kilogram, but only 
through the physical properties of water at a special temperature; in 
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effect it has an arbitrary definition. Units chosen in this way we shall 
call “independent”: thus the metre, kilogram, second, litre, calorie and 
degree centigrade are all independent units. In any system of measure¬ 
ment there will be a number of independent (or “fundamental”) units, 
and other units will be derived from these. 

Now if we take independent units for energy E and heat Hy we find 
that heat and energy are interconvertible according to the rule, i unit 
of heat = J units of energy. Thus on taking i gram-calorie as the heat 
unit, and i joule as the energy unit, we find i gram-calorie = 4*185 
joules. Here the constant J expresses the number of joules for each 
gram-calorie, and therefore expressed in its correct units it is 4*185 
joules/gm-cal. If instead we t^e i kg-cal as unit of heat, and i erg 
as unit of work, we find that J ~ 4*185 X 10^® ergs/kg-cal. This 
follows from the usual rule for conversion of units 

4185 joules/gm-cal = 4*185 X (10^ ergs)/(io~® kg-cal) 

= 4185 X 10'® ergs/kg-cal. 

In general H units of heat are equal to E units of energy, where E — JH. 
Note that this equation is dimensionally correct: E is measured in units 
of, say, I joule, while JH has the units joule/gm-cal x gm-cal = joule. 
In this equation ^ is a universal physical constant. That is to say, it 
has a fixed value independent of the place, time, and conditions of 
measurement, or so experimental determinations seem to show. It is 
also a dimensional constant, i.e. its numerical value depends on the 
units of measurement. 

Now instead of measuring heat in terms of the arbitrary unit of 
I calorie we can use instead the joule. This is a natural unit to take, 
since we believe that heat is a form of energ}% and the joule is the 
M.K.S. unit of energy. In these units the equation for converting heat 
into energy becomes simply E = //; the constant has now disappeared, 
or rather it has become unity. As we have said, this is physically 
reasonable; but the possibility of making this change follows purely on 
mathematical grounds, without any appeal to physics. All that is 
necessary is to replace the arbitrary unit, the calorie, by a new unit 
equal to ijj calories. In this new unit a quantity of heat = H calories 
will become H' = JH units, so that the equation E = HJ will become 
E = H'. In general any equation containing a dimensional constant 
can, by means of a suitable change of units, be replaced by a new 
equation without the constant: the effect is to replace one independent 
unit by a new derived unit. Thus, if gy the acceleration of gravity, 
was an absolute constant on the earth’s surface, we could change our 
units so as to make it equal to i. The simplest way would be to replace 
the metre by a new unit equal to g metres. The equation y = \gt^ 
for the distance a body falls in time t would be replaced by y = 
and many other dynamical calculations would be greatly simplified. If 
we measure the time t in seconds, the equation y — shows that 
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we should then measure distance in square seconds—a somewhat 
unfamiliar idea: i “square second” would be by definition g metres, 
or twice the distance a body falls in i second. In this system mass 
and force would be expressed in the same units: i kilogram weight 
would be equal numerically to i kilogram. However, since g is not 
constant this substitution is not possible in practice. 

There are two ways of looking on the formula E — JH relating 
heat and energy. Either we can say that there is a “universal constant 
of nature” J which relates heat to energ}% or we can look upon the 
constant J as arising because we are so perverse as to measure the 
same quantity, energy, in two different units. It is at least possible 
to argue that most dimensional constants arise from such perversity. 
Thus the velocity of light c is one such constant. If we accept the 
Special Theory of Relativity we see that time and space are merely 
two aspects of the same thing, and so distances and times are really 
quantities of the same kind. Since we use different units, the metre 
and the second, for distances and times respectively, we can expect 
the appearance of a universal constant c which is really no more than 
the ratio of the two units. However, the question of whether two 
physical quantities are “really” of the same nature, or “really” difi'crei.t, 
IS a fine philosophical point on which one can argue interminably. 'I'he 
question cannot be settled on the basis of the units customarily used to 
measure the quantities, which are largely conventional. Heat is usuallv 
expressed in different units from other forms of cnerg)', although we 
believe them to be essentially the same. On the other hand the moment 
of a force and the work done by a force are both measured by a product 
force X distance. Hut few people would suggest that a moment was 
a kind of work. 'I'he only certainty is that a dimensionless constant, 
J.e. one independent of units, cannot be eliminated, and must therefore 
express a real natural law. An example would be the ratio of the 
electrical force between two electrons to the gravitational force calculated 
from their masses by means of the universal law of gravitation. 

16.21 Atomic weights 

Atomic weights measure the relative masses of the atoms of different 
elements. There arc unfortunately two different units: on the “chemical 
scale” the atomic weight of atmospheric oxygen is taken as 16 exactly. 
This is really the mean atomic weight of the different isotopes contained 
m it. On the “physical scale” the isotope O'® is taken as standard, 
having an atomic weight of 16 exactly, and atmospheric oxygen then 
has the weight 16 0044. I hus the physical atomic weight is obtained 
from the chemical one by multiplication by 1*000272; for most j>urposes 

this correction is negligible. In what follows we use the chemical 
scale. 

The atomic weights and atomic numbers of the commoner elements 
are given in 'Fable 16.3. 
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Table 16.3—Atomic weights (chemical scale) 


\ 

Element 

No. 

At. wt. 

Element 

No. 

At. wt. 

Aluminium 

% # 

Al 

13 

2697 

Lithium 

# 4 

Li 

3 

6*94 

Antimony 


Sb 

51 

121*76 

Magnesium 

4 4 

Mg 

12 

24*32 

Argon . . 

• # 

A 

18 

39-94 

Manganese 

4 4 

Mn 

25 

54-93 

Arsenic 

# 4 

As 

33 

74-91 

Mercury 

4 4 

Hg 

80 

200*61 

Barium 


Ba 

56 

137-26 i 

Molybdenum 

4 4 

Mo 

42 

95-95 

Berj’llitim 


Be 

4 

9-013 

Nickel .. 

4 • 

Ni 

28 

58*69 

Bismuth 

• ^ 

Bi 

83 

209*00 

Nitrogen 

4 4 

N 

7 

14*008 

Boron . . 

4 4 

B 

5 

10*82 

Oxygen 

4 4 

0 

8 

i6*ooo 

Bromine 

« 4 

Br 

1 35 

79*92 

Phosphorus 

4 • 

P 1 

15 

30*98 

Calcium 

% 4 

Ca 

1 20 

40*08 

Platinum 

4 4 

Pt 

78 

195-23 

Carbon 

4 4 

C 

! 6 

12*01 

Potassium 

4 4 

K 

19 ; 

39-096 

Chlorine 

4 % 

Cl 

17 

35-46 

Selenium 

4 4 

Se 

34 

78*96 

Chromium 

4 4 

Cr 

24 

52*01 

Silicon 

4 4 

Si 

14 

28*06 

Cobalt .. 

s % 

Co 

27 

58*94 

Silver .. 

4 

Ag 

47 

1 107*88 

Copper 

• 4 

Cu 

29 

63*54 

Sodium 

• 4 

Na 

11 

22*997 

Fluorine 

4 4 

F 

9 

19*0 

; Strontium 

9 9 

Sr 

38 

87-63 

Gold .. 

4 4 

Au 

79 

197*2 

Sulphur 

4 4 

S 

16 

32*066 j 

Helium 

4 4 

He 

2 

4*003 

Tin .. 

4 4 

Sn 

SO 

118*70 1 

' Hyorogen 

9 m 

H 

I 

I *008 

Titanium 

4 4 

Ti 

22 

47-90 

; Iodine . . 

4 

I 

53 1 

126*92 

Uranium 

9 4 

U 

92 

238*07 

, Iridium 

9 9 

Ir 

77 

1931 

Vanadium 

9 4 

V 

23 

50*95 

1 Iron 

9 4 

Fe i 

26 

55-85 

Wolfram* 

4 4 

W 

74 

183*92 

1 Lead 

1 

i 

• % 

Pb 1 

1 

82 

207*21 

Zinc 

4 4 

Zn 

30 

65-38 

1 


• Or Tungsten. 


From these atomic weights we obtain the chemical unit of mass, i.e. 
the mole or gram-molecule, defined as the molecular weight in grams. 
(Presumably the corresponding M.K.S. unit would be the “kilomole’".) 
The concentration of a solution is usually expressed in units of i molar 
concentration (i mole/litre = i kmol/m®, very nearly) or normal (N) 
(= I gm-equivalent/litre, i.e. molar concentration divided by valency). 

16.22 Temperature: specific heats and conductivities 

T'he universal unit of temperature is the degree centigrade, of the 
difference in temperature between melting ice and boiling water at 
76 cm of mercury pressure. Other scales of temperature are the 
Fahrenheit, Reaumur, and Absolute (or Kelvin) scales; these are con¬ 
nected by the relations that a temperature will be measured as tc° on the 
Centigrade scale, /p'’ Fahrenheit, tp° Reaumur, and Absolute, where 

/c = i (^F - 32) = Hr *= - 273*16 . . (16.13) 

The absolute temperature ?k is usually denoted by the symbol T. 
It can be exactly defined by means of the second law of thermodynamics 
(see text-books on the subject) and is very nearly proportional to the 
pressure of a given mass of hydrogen at constant volume. 

The specific heat of a substance is the quantity of heat required to 
raise unit mass through unit temperature. In the C.G.S. system the 
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unit will be i gTi-calorie per gram per degree Centigrade. Accordingly 
pure water at 15° C has unit specific heat, by the definition of the gram- 
calorie. For other substances typical values are: iron, 10; lead, 03; 
glass, -12 to -19. 

If the temperature within any body is not uniform a conduction of 
heat will take place from the hotter parts towards the cooler. In parti¬ 
cular, consider a slab of uniform thickness b and area A, and let one face 
be maintained at a temperature T'j and the opposite face at temperature 
T 2. The rate of conduction of heat is then directly proportional to the 
difference in temperature Tj —and to the area A of the slab, and is 
inversely proportional to the thickness b. The amount of heat H passing 
through the slab in time t will therefore be 

H = c{T^~T^t Ajb . . (16.14) 

where c is a constant, called the “ conductivity ” of the substance, and 
measured by the heat passing through a unit cube in unit time when 
unit difference of temperature is maintained between the opposite faces. 
The ordinary unit of conductivity is therefore the gm-cal cm~^ sec“'. 
Values of the conductivity for typical substances are (in gm-cal. cm“* 
sec“* deg-^) copper, -176, glass 002, wood 005, water at 10® C, 0015. 

EXAMPLE 

(i) E. G. Glaser {Nature, 166 (1950), 1068) performed some experi¬ 
ments on immersion in cold water. 'I’he subject of the experiments was 
a man aged 36, of height i -8o metres, weight 85 kg, and therefore pre¬ 
sumably with surface area 2 05 m- by the Dubois formula. It was 
assumed that his surface tissues would conduct heat at the rate of about 
10 kgm-cal. per square metre per hour per temperature-difference 
between his deeper tissues (about 37® C) and the surface. At a water 
temperature just above freezing point the rate of loss of heat should be 
about 2*05 m- X 37® X 10 kg-cal. m-^ hour“* °C-', i.e. about 12-5 kilo¬ 
gram-calories per minute. Assuming further that his specific heat was 
about -83 kilogram-calorics per kilogram per degree centigrade, this 
should result in a fall of temperature of the order of 12-5/(85 x -83) 
degrees per minute, i.e. about -18 degrees per minute. Further experi¬ 
ments showed that while the natural rate of heat production was only 
1-2 kilogram calories per minute, this increased to about 12-5 with 
moderately hard swimming, i.e. sufficient to counterbalance the loss of 
heat due to the coldness of the water. (So if one falls into cold water 
the best thing to do is to keep swimming, and so keep warm.) 


The law connecting temperature, pressure and volume of a *' perfect 
gas ” is PV = nRT, where n is the mass of gas expressed in molecular 

fill 
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weight units, and R is the “ gas constant **. Although real gases do not 
exactly obey this law, most of them approximate very closely to it at a 
sufficient dilution. The value of R in various units is 

Logarithms 


8-3142 X 10’erg deg“^ mole"^ 7-91982 

8-3142 X 10® joule deg“'kmole”^ 3-91982 
82-061 cm® atmos deg”* mole”* 1-91414 

1-9865 cal. deg”* mole”* -29809 

654-06 cm® cm-Hg deg~* mole”* 2-81562 


The degree is of course an arbitrary unit of temperature. If we wish 
we can obtain a derived unit in which the gas constant R = i; it is only 
necessary to take {ijRY as the unit, i.e. the derived temperature unit in 
the C.G.S. system is 1-2027 *** M.K.S. system it is 

1-2027 lO”* °C. This unit does not seem to have ever been used in 
practice. It has the curious property that it makes the specific heat per 
molecular weight unit a dimensionless constant. In fact twice this speci¬ 
fic heat is usually very nearly a small integer, representing, according to 
the kinetic theory of heat, the number of different ways in which a mole¬ 
cule can move or rotate (the “ degrees of freedom ** of the molecule). 
Also in these units the two specific heats of a gas, at constant volume and 
at constant pressure respectively, differ by 1 for a molecular weight unit 
(instead of R/J in the usual formula). 

16.23 Chemical reactions 

If a reaction is of the form molecule o( A-\- molecule of B-^ reaction 
products i? + 5 , then Guldberg and Waage^s “ law of mass action ** 
states that the velocity of the reaction is proportional to the product of 
the concentrations of A and of B, i.e. 

reaction velocity = KlAl [B] 

where [A'] and [ 5 ] stand for the concentrations of A and B respectively. 
At any one particular temperature K is constant (and is known as the 
“ reaction constant ”). It usually increases exponentially with tempera¬ 
ture. 

Consider for example the inversion of cane sugar 

^12^22^11 "b II 2 G —> CgHjoOe + CeH220g 

Dextrose Lacvulose 

Then the reaction velocity = ATfCigHooOi,] [HgO]. If the reaction 
begins with a very dilute solution the concentration of water will be 
nearly unity (in suitable units) and will remain so throughout the reac¬ 
tion. 

Let a: = [C1 gH 2 2G11] denote the concentration of cane sugar at time 
t from the start of the reaction. Then the reaction velocity will be equal 
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to the rate of decrease of .v, —dxjdt = —and according to the law of 
mass action this is approximately Kx^ i.e. 


.Vf = — Kx. 

This differential equation has ‘‘ separable variables (case A, Section 
10.9). It is therefore solved as follows: 

Since dxjdt — — Kx, dtjdx = — ijKx, and on integration with 
respect to j:, / = In ^ + C. Now if the initial concentration 

when / = o is Xq, we have on substitution 


o = —K Mn atq + C, i.e. C = AT”* In Xq. 

So / = ]nx -h K-^ In ato 

= iC“* In (aTo/a-). 

and therefore K = In {xjx). 

This can be checked by measuring the concentration .v of cane sugar at 
different times t, and calculating i~^ In (.v„/.v). If our theory is correct, 
all such values should be equal, and their common value is the reaction 
constant K. 

The following table shows some experimental results: 


t (in minutes 
from start) 

xjx ^ 

t ‘ In (.Vo/y) — K 

0 

1 

I 


1.435 

•58240 

2 ' 7(>2 X io"< 

4 . 3*5 

*9530 

3785 X lO-^ 

7,070 

•07008 

3 759 X 10-^ 

11,360 

•01484 

3-707 X IO~^ 

i 4.*70 

■00534 

3-692 X lO'^ 

* 6.935 

•00185 

3716 X IO“‘ 

*9.8*5 

•00069 

368 X IO~^ 

29.925 

1 

•00001 

38 X 10-^ 


The constancy of K verifies the theory. 

Note that for a unimolccular reaction the equation t = K-^ In (.v„/.v) 
shows that when .v = Uo, i.e. when the concentration is halved, 't - 
K-^ In 2 = a constant independent of the initial concentration .v„. 
T'his can be used as a test for the unimolccularity of a reaction. For 
pample when diphtheria antitoxin was injected in various strengths 
into the bodies of animals Ilornstein found that the quantity rernainino- 
after 4 days was a constant fraction of the original amount, thus suggest¬ 
ing a unimolccular reaction. He confirmed this by plotting log .v (logar¬ 
ithm of antitoxin concentration) against t (time elapsed), obtaining a 
straight line as indicated by the formula Kt = In {x^jx) = In .v — In v 
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If a reaction is of the form zA -f B i.e. A A + B -> 

R -{• S then the law of mass action indicates that the reaction velocity 
must be proportional to [A] [A] [B] = [Ay [B]. Suppose that A is 
present in a very small concentration [A] — Xy while that of B can be 
taken as practically constant. Then the reaction velocity will be 
—dxjdty and will be of the form kx~y where A is a constant. We have 
now to solve the differential equation 

dxjdt = — kx^ 
i.e. dtfdx = — k~^ x~^ 

whence on integrating with respect to Xy 

t = k-^ x-^ -h C. 

It follows that for such a reaction we obtain a straight-line graph on 
plotting .v-^ against t (see Section 7.3). Similarly, a trimolecular reac¬ 
tion should give a straight line on plotting x~^ against t. 

Now consider a reversible reaction, say one of the form 

zA B ^ ^R S 

'Then the forward reaction will proceed with velocity ATj [A]^ [B], where 
AT, is its reaction constant, and the backward reaction will have a velo¬ 
city K2 [B]^ [B]. When the equilibrium point is reached these velocities 
must be equal, i.e. 

K, [AY- [B] = K2 [RY [S] 

whence [yl]- [B]/[/?^] [B] = /Co/ATj = a constant, known as the “ dis¬ 
sociation constant ” for the reaction. In general if the reaction has the 
form 

a . A b . B -f- . . , ^ T . R s . S - • - 
(where a . A means “ a molecules of A ”) we shall have 

[/!]“ [B]" . . ./[/?]'■ [B]^ ... — a constant k. 


T’hus suppose that the reaction between oxygen and haemoglobin 
has the form 

Hb+H0,^Hb02 
Then when equilibrium is established 

= dissociation constant k. 

[Hb][OJ- 

Denote the concentration [O2] of oxygen by x: then this equation gives 

[HbOaMHb] = Ax'* . . . (16.15) 

Adding i to each side we have 

([HbO,] + [Hbl)/[Hb] = I + A.V” . . (16.16) 
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Divide equation (16.15) t>y (16.16): this gives 

[HbOa] kx- 

[HbO^] + [HbJ 


I H- 


^ ^ loorHbOgl , 

15ut y — j I the percentage s3tur3tion of 

haemoglobin with oxygen. This establishes A. V. Hill’s equation 

ioo/iA"/(i 4 -A.v'*) . . ( 16 . 17 ) 


16.24 Electric units 

Electrical units can be related to the units of length, time, and mass 
in three distinct ways— 

(а) using the force between charges at rest; 

(б) using the force between currents; 

(c) using electrolysis, i.e. the decomposition of a chemical substance 

by the passage of an electric current. 

* 

These lead to three possible systems of electric and magnetic units. 
Those derived from (a) are known as “ electrostatic units ” (e.s.u.), 
those from (A) are “ electromagnetic units ” (c.m.u.), while (c) does not 
seem to have been ever seriously considered. None of these units have 
ever been used in practice, as they are of a quite inconvenient order of 
magnitude—although a hybrid mixture of electrostatic and electro¬ 
magnetic units (known as Gaussian units) is found in many text-books 
on theory. To fill the gap a further system of “ practical ” units has 
grown up. 

The student will find this complicated situation in many books. 
However as long ago as 1901 G. Giorgi showed that it was possible to 
obtain a system of units which actually simplifies the theoretical relations 
and also includes the practical units as a special case. In 1935 this 
Giorgi system was officially adopted by the International Electro¬ 
technical Commission. A detailed discussion would be outside the 
scope of this book: a good elementary account will be found in E G 
Cullwick’s book The fundamentah of electro-magnetism (C.U.P. 1930) 
(although some of Cullwick’s arguments are a little controversial) It is 
true that electrical theory is not often used in biology, but it may be 
worth while giving a brief resume here to show how the different aspects 
are quite simply linked together. It also illustrates an important type 

of differential equation (equation 16.30) which may occur in other con¬ 
nections. 

The Giorgi system is based on four units: the metre kilogram 
second and coulomb as units of length, mass, time and electrical charge 
respectively (There are other ways of choosing the four fundamental 

units, but they are equivalent to these.) The coulomb can be defined 
formally by the property: 



470 


BIOMATHEMATICS 


“ Two equal charges of one coulomb placed in empty space at a 
distance of i metre apart will repel one another with a force of 8 988 
X 10® newtons. The constant 8-988 X 10® is chosen to be iO“’ c®, 
that is, numerically equal to one ten-millionth of the square of the 
velocity of light measured in metres per second.*’ 

In general two charges of and coulombs respectively when placed 
at a distance r metres apart will repel one another with a force 

/= netnons . . (16.18) 

where /cg — 1/(8-988 X 10®) = = 11126 X coulomb® 

joule“^ m~^. This is the inverse square law for the force between two 
electric charges. The constant 8-988 X 10® which is used to define 
the unit of charge, the coulomb, may seem at first sight to be rather 
oddly chosen, but in practice it turns out to be rather convenient. The 
reciprocal constant kq = 1-1126 x lo-^® is kno™ as the “permittivity 
of free space From this inverse square law it is possible to deduce 
the forces acting between any systems of electric charges at rest. 


PROBLEM 

(i) Show that if any three unknown charges q^y q^ and q^ are given, 
it is theoretically possible to deduce the value of ^1, q^y q^ by measuring 
the forces between each pair of charges when placed in empty space— 
except that it is not possible to distinguish between the system of 
charges q^, q.y, q^, and the system — —q2, —q^. 


Aote I he equation / — ''"ill also hold in any other 

system of units, provided that the appropriate value of /cg is taken, 
according to the_ usual rule for change of units (Section 16.8). The 
relation /cg -- lO' c- will no longer necessarily hold. 

Electricity can be caused to move by various means, including simple 
mechanical transportation, electromagnetic forces (as in lightning 
flashes, dynamos), chemical action (as in cells, accumulators, nerve 
cells), heat (thermoelectric couple), and light (photoelectric cell). A 
steady flow ot electricity is called a current, and is measured in coulombs 
per second, or amperes. More generally, if q coulombs have been trans¬ 
ported down a wire after t seconds, the current will be measured by the 
rate of change of that is, dq'dt amps. 

Like everything else electricity will only move if it is urged on by a 
force, and so as it moves it must do work.' I’his work will generally be 
dissipated as heat, but can also be used chemically or mechanically. 
1 hus suppose a charge is acted upon by a force with componen f along 
its line of motion; then in moving from a point A to a point B it will 
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gain energy equal to J^ f dL^ where L represents the distance 

travelled along its path (Section 11.7). In particular, if O is a charge 
with fixed position in empty (or nearly empty) space, and 9 is a movable 
charge, at a distance r from it follows that the force on 9 is / = 
Qqlxf^r^. It follows that if q moves outward from a distance r = a to a 
distance r = 6 it gains energy equal to 

L \a bj 


This energy can be used to do work. 

Notice that the energy gained by the charge q (which can be spent 
in doing external work) is proportional to q. That is only to be expected: 
2 coulombs will gain twice as much energ>' as one coulomb in moving 
from A to B, provided that the rest of the system is the same in both 
cases. The amount of energy supplied to a unit charge moving from A 
to B is called “ potential drop ” V between A and B: the energ\’ sup¬ 
plied to a charge q will therefore be qV. For example, the potential drop 
between a point A at distance a from a charge Q and a point B at dis¬ 


tance b is (potential of A — potential of B) = 



The unit 


of potential drop is accordingly i joule per coulomb, named i volt. 

It follows that if a steady current of I amps = / coulombs per sec¬ 
ond is passing down a wire along which there is a potential drop of V 
volts, it will be given energy at the rate of IV joules per second, that is, 
IV watts. This energy will be dissipated as heat at the rate of ]V = IV 
joules per second, or IVfJ calorics per second. 

It is an experimental fact that in most conductors the current I is 
exactly proportional to the potential drop V (Ohm’s law). Thus we 
can write 

V — IR . . , . (16.19) 


The constant of proportionality R is called the “resistance ’’ of the cir¬ 
cuit. It is measured in volts per ampere, called ohms. Combining this 
relation with the relation W — IV for the rate of production of heat 
energy we have 

IV= VyR : I^R . . (16.20) 

It can be readily shown from Ohm’s law that conductors with 
individual resistances R^, R^, 7?3, etc. will give a total resistance 
7^1 -b 4 ' /^3 -b • . . when joined in series, and a resistance equal to 
( 7 ?i“' -b 7?2”‘ + 7?3”* -b . . .)”* when joined in parallel. 

The total drop in potential summed over all the wires in an electric 
circuit is called the “ electromotive force ’’ or “ e.m.f.’’ E in the circuit. 
It follows that if I is the current flowing round the circuit, and R is the 
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total resistance, + i?2 + -^3 + ■ . then E = IR; and the total 
rate of heat production is W = El. 

When a current passes through an electrolytic cell there is a simple 
proportionality between the amount of substance decomposed and the 
total quantity of electricity passing through the cell, according to the rule 
that 96,490 coulombs are required to decompose i gram-equivalent, or 
9 6490 X coulombs to decompose i kg-equiv. This constant of 
proportionality is known as the faraday^ F. The values of the funda¬ 
mental constants can be summarized thus: 

Logarithms 

velocity of light c — 2-998 X 10® m/sec 8-47683 

permittivity of vacuum 

#co— 1-1126 X lO”^® coulomb® joule”^ m*“^ 10 04634 

faraday F = 9 649 X 10’ coulombs/kg-equiv. 7 98448 

16.25 Dielectrics: electric capacity 

The inverse square law/ = y, q^jK^r- is valid only in empty space. 
If the charges and 92 2re imbedded in a material body then the charges 
inside the atoms of this body will be disturbed and redistribute them¬ 
selves. The net effect will be to replace the law by 

/= ?i 92 /(^«o''") • • • (16.21) 

where iC is a constant characteristic of the material concerned, and 
known as its “ dielectric constant ” or “ relative permittivity It is a 
pure number: typical values are 1-00058 for air at normal temperature 
and pressure, 4 to 10 for glass, 1-7 to 2-9 for india-rubber, 4 to 8 for 
mica. 1 he product Kk q is the “ absolute permittivity k. 

T his fact is made use of in constructing condensers. When a body 
is raised to a potential V volts, it will acquire a small charge g coulombs, 
proportional to V. We write 

q ~ CV .... (16.22) 

where C is the capacity ” of the body measured \n farads or coulombs 
per volt. A farad is too large a unit for ordinary use, so that most 
capacities are measured in microfarads {ixF) = lo"® farads. 

A condenser is a device to hold small quantities of electricity in 
a compact space. *1 he usual form consists of two parallel plates of area 
A (square metres) separated by a thin layer of mica, or similar insulator, 
of thickness S (metres) and dielectric constant K, Such a condenser has 
a capacity C = Kk farads, i.e. when a potential difference of V 
volts is applied between the two plates, it holds a charge of CF coulombs, 
(This is a consequence of equation 16.21, but we shall not go through 
the details here.) In a similar way, if a condenser is formed by two con¬ 
centric cylinders of radii /?, (inner cylinder) and R„ (outer cylinder) 
respectively, and of length L (metres), the space between the cylinders 
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being filled with an insulator of dielectric constant K, the capacity will 
be approximately \KkqL In {R^jRi) farads. This may be relevant to 
the transmission of a signal along a nerve fibre, which has a very similar 
construction. 

16.26 Magnetic units 

The inverse square equation (16.18) represents the force between 
two stationary charges. When either or both of the charges are in 
motion there are additional forces between them. 



In (Fig. 16.9), let the distance between the two charges be r (metres). 
Let the charge move with a velocity (metres per second) and an 
acceleration (metres per second^), and let move with a velocity v^. 
Let a be the angle between and the line joining the charges, and 
further let n be the vector perpendicular to the plane containing the 
direction and the two charges, and ^ the angle between n and 
the direction of motion 

Then in consequence of the motions of the charges 5 2 is acted upon 
by a force 

/ = <71 $2 ^’1 ^2 sin a sin newtons . (16.23) 

perpendicular to n and to Put into simpler language this e.xpression 
means that this “ magnetic force ” tends to curve the track of the charge 
^2 without altering its speed. Furthermore if and q^ are of the same 
sign, and if when we look at the whole system along the direction n the 
charge q^ appears to be moving around ^3 in a clockwise direction, the 
effect will be to curve the path of q^ in an anticlockwise direction (and 
vice versa. If the charges are of opposite signs it will be found that either 
both will be clockwise, or both anticlockwise.) To take a simple case 
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consider two charges moving side by side in parallel paths, so that 
a = jS = 90°; the force equation then reduces to 

f = . . (16.24) 

where /io= io“’ ne\\tons secVcoulomb^. Equation (16.24) 

means that there will be an attraction between charges moving parallel 
to one another. It follows that two parallel wires carrying currents in 
the same direction will attract one another, and so will tw'o parallel coils 
or electroniagneis. The same effect occurs in a permanent magnet, 
where the electrons in the iron or other magnetic material all spin 
parallel to one another, and by such spin behave very similarly to coils 
carrying currents. 

If the charges are not in free space but embedded in a material 
medium, the forces in equations (16.23) (^6.24) must be multiplied 

by a factor ^ called the “ permeability ” of the medium. 

Besides this “ magnetic ” force produced by the velocities and Og 
of the charges, there are still further forces produced by their acceler¬ 
ations. The force on produced by the acceleration of is 

% 

/= sin ^/r . . (16.25) 

where «/»is the angle between the direction of the acceleration and the 
line joining the two charges; and / is directed at right angles to this 
line, in the same plane as and its component along the direction of 
is opposed to the acceleration. In other words, whenever an electron 
accelerates in a forward direction, all other electrons in the vicinity 
experience a backward force inversely proportional to their distance, 
and directly proportional to the acceleration in question. This effect is 
known as “ electromagnetic induction ”. (It should perhaps be added 
that all our equations, such as (16.18), (16.23) (^6.25) are approxi¬ 

mations which are very closely obeyed for ordinary velocities, but are 
subject to correction for velocities approaching that of light. There is 
also a small additional force due to the combined effect of the velocities 
and accelerations, but this can normally be neglected.) 



Fig. i6.io—Electromagnetic induction between coils 

Induction is the basis of the electrical transformer. Consider two 
coils of wire, IF, and IF2 (Fig. 16.10) placed close to one another. 
Through we imagine a current I amps to flow. Suppose now that 
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this current is increased: then the electrons in IVi will be accelerated, 
and consequently those in 1^2 "'*^1 experience a backward force. Ex¬ 
pressed in another way, there will be a back electromotive force E in coil 
IV2 proportional to the acceleration in PFj, i.e. to the rate of change of 

current dljdt = I = /,. 

Thus we can write 

E = — Mdijdt . . . (16.26) 

where iV/ is a constant, called the “ coefficient of mutual inductance 
It is measured by the e.m.f. in volts induced in IV2 by a change in current 
in IVi of I amp per second; i.e. the unit of inductance is i volt. sec/amp 
= 2 henry. 

We have spoken above of different coils and IF2: but it is by no 
means essential to our argument that the coils should be different. The 
same results apply to a single coil, which has the property of resisting 
any change in current by producing in itself a back electromotive force 
E volts given by 

E= —L dljdt . . . (16.27) 

Here L is the “ coefficient of self-inductance ” and forms a sort of 
electrical inertia of the system, which resists any sudden change of 
current much as ordinary mass resists any sudden change of velocity. 

EXAMPLE 

(i) The oscillating circuit (Fig. 16.ii). Suppose we have a circuit 
containing a resistance R (ohms), a condenser (capacity C farads) and a 


Reiistance R 



Fig. x6.xi—An oscillating circuit 

coil {L henries self-inductance). Suppose that at time t the condenser 
contains a charge q coulombs, so that the potential difference V between 
its plates is qjC volts (by equation 16.22). Furthermore the current I 
amps in the circuit will be produced by the emptying of the condenser, 
so that 7 is minus the rate of change of i.e. 

/ =z= —dqjdt = —Dtq 


(16.28) 
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The whole system will now act very like a simple pendulum. The cur¬ 
rent will try to flow from one plate of the condenser to the other, but in 
doing so it will be damped by the resistance and hindered by the coil 
which will oppose any change in the current. By the time the current 
has really got going the condenser will be empty: but the current cannot 
stop itself owing to the presence of the coil, and so the condenser will be 
charged up again in the opposite direction before the current stops. The 
cycle of events then repeats, and we get a series of oscillations. 

It is easy to investigate this mathematically. The back e.m.f. due to 
the coil is —LDJj i.e. LDt^q (sometimes written Lq^ where the two dots 
denote the second derivative with respect to time). The forward e.m.f. 
due to the potential difference in the condenser is j/C, so the net e.m.f. 
in the circuit is qjC + Lq volts. By Ohm*s law this must be equal to the 
current (—Diq = —g) times the resistance R, i.e. 

qjC -\- Lq = — qR 
or Lq + H- qJC = o 
i.e. LD^q + RD^q -}- qlC — o 

or Lqtt + Rqt + qjC — o ... (16.29) 

according to the notation employed. This is a linear differential 

equation of the second order with constant coefficients ”. We have 

already considered special cases. If Z. = o, and w’e have no coil, the 

equation reduces to Rq + qjC = o, i.e. q = g, = —g/RC, and this has 

the solution g = an exponential decay like Newton's law of 

cooling or the decay of a population under unfavourable conditions. 

If R ~ o, i.e. there is no resistance, the equation becomes Lij + qJC = o, 

i.e. q = qt, = — oj^q where cu = ijLC. This corresponds (by Section 

15.2) to steady periodic oscillations of period ztt/w = 27r-\/(Z.C): the 

solution is q — K cos (wt + ^) where K and ^ are arbitrary constants. 

In the general case in w'hich L ^ o we proceed as follows. We first 

find the values of A (a constant) for which q — is a possible solution 

of (16.29). Since in this case qi — Ae^t, q^^ = A*e^t, and substitution in 
(16.29) gives 

(LA2 -i. RX + liOe^t = o 

or LA- 4- RX -I- i/C = o . . . . (16.30) 

since ^ o. Now by hypothesis L 9^ o, so this is a quadratic equation 
for A with at least one root and probably two distinct roots 

^2 - [-R i ^'(R 2 _ 4i/C)]/2L 

which may hotvever be real or complex. We see that A^ + Ag = —RJL, 

i.e. 

Aj = Aj -f (Aj 4- A,) — 2 Aj 
“ Aj (2A1 4 " RjL) 


(i6-3i) 
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SO that the roots Aj, A2 are distinct if and only if 2A, -f- RjL o. Now 
let q be any solution of the equation (16.29). Write Q = so that 

q = .... (16.32) 

Then qt = Qte^'‘ 4 - QXie^>i 

and substitution of these values in equation (16.29) gives us simply 

L\Qtt 4 - (2A1 4 - — o 

as all the other terms cancel, and since L ^ o and ^ o this gives 
us the equation 

4 -(2A1 4-i?/L) 5 ( = o . . (16.33) 

If now we can solve this equation for Q we can substitute back in 
(16.32) and find the general value of q. But this is quite simple. There 
are two cases to consider. 

Case I. 2A| 4- RjL = o, i.e. the quadratic (16.30) has only one 
root Aj = —RjzL (and therefore necessarily real). 

Equation (16.33) then reduces to Q,t = o, or on integrating with 
respect to 

0, = / Q„ dt = C„ f Q, dt = C,t + C„ 

where Cj and are constants of integration. By (16.32) the solution is 

q = {C,t + 

= {C,t + . . . (16.34) 

Thus the current decays steadily and rapidly to zero with no oscillations. 
Case II. 2A1 4- RjL ^ o. 

Write 0 — Qt, then (16.33) becomes = —(2A, 4- i?/L) 0 , which 
has the solution tp = Q, = On integrating with respect to 

t this gives Q = CaC-t^A.+zt/z-)* where Cg = —KI{2X^ 4- RjL) 

and Cl is a constant of integration. But by (16.31) (2A1 +RIL) = 
Ai — Ag, whence Q = 4 - Cj, and the general solution for 

q is by (16.32) 

q = Qe>^‘t = + C^e^'t . . . (16.35) 

provided that the roots A, and A2 are distinct. This accordingly gives 
us the general solution of the differential equation (16.30). 

If Aj and A2 are real, they must be negative in our case, from the 
formula for the solution of the quadratic equation (16.30). It follows 
that q is the sum of two components each of which decays exponentially, 
without oscillation. 

If however Aj and A 2 are complex they must be conjugate complex 
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quantities (being solutions of a real equation, Section 14.18) and can be 
written — a + r )3 and —a — Since Aj + Ag = —we have 
a = RjzL — a positive quantity. Thus from (16.35) 

q=^ +Ci 

— Q-at ^ C,e^P'] 

= e”®' C 2 (cos — i sin / 3 /) + Cy (cos pt -i- i sin ^/)] 

= (Cg + Cl) cos / 3 f -f (( Cl — f Cg) sin ^t] 

— A cos -r B sin ^t] (sdy). 

The expression in the square brackets represents an oscillation with 
period 

2Tr/j8 = 47rjy(Ai — Ag) = 477 Lj — R^). 

The expression outside the bracket, e*“*' = g-*Ri2L^ shows that this 
oscillation decays in an exponential or geometric manner, and can be 
shortly described as a “ damped oscillation 

Similar conclusions can be expected whenever we have a “ linear 
second-order equation with constant coefficients ” such as (16.29), 
if the interpretation of the quantities concerned is in no way electrical. 
It might also be expected that a similar approach would be helpful in 
the study of the transmission of impulses in nerves; but the situation 
will then be more complicated, since besides the capacity, resistance and 
inductance of the fibre there are also complicated chemical changes to 
be taken into account. 

16.27 Light and radiation 

A comparison of the equations (16.18) for the electrostatic force, 
(16.23) electromagnetic force, and (16.25) force of induc¬ 

tion, shows that in the last equation the force decreases with distance 
as i;r, while in the other two it decreases much more rapidly as the 
inverse square i;;-. Thus at great distances it will still be possible to 
detect the effect of the acceleration of an electric charge, even when the 
etfcct of its velocity or the simple electrostatic effect have become 
negligible. T his is the principle of a wireless transmitter, in which the 
electrons in the aerial are constantly maintained in a state of rapid 
oscillation. Other examples of electromagnetic radiation of this sort are 
light, radiant heat, ultra-violet rays, X-rays and gamma rays. 

By using a more exact form of the equations than the ones we have 
given it can be shown that the radiation, or field produced bv acceler¬ 
ation, is not transmitted instantaneously (as implied by equation 16,25) 
but travels outwards with velocity c in'empty space, and c/a/(A.» in a 
medium of dielectric constant K and permeability It follows that 
is the ratio of the velocity of light in free space to the velocity 
in the medium, and is called the “ refractive index ” n of the medium. 

The principal property of n is the refraction equation sin f/sin r = w, 
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where i is the angle of incidence and r that of refraction. (For proof, see 
any text-book on optics.) When i and rare small this becomes ijr c: n. 
From this it can be shown that if a thin lens is made of material of 
refractive index n and has spherical surfaces of radii of curv'ature 
and i?2 respectively (measured positively if curved away from the ob¬ 
ject), and focal length/, then 



where u metres = the distance of the object, and v metres is the dis¬ 
tance of the image. The reciprocal ijf of the focal length is called the 
“ power ” of the lens, and is measured in dioptres, where i dioptre (D) = 
i/metre. This unit is of importance in expressing the degree of short¬ 
er long-sightedness of an individual; the necessary correcting lens will 
have its power calculated in dioptres. 

The two most important properties of a beam of light or radiation 
are the wavelength, the physical equivalent of colour, and the amplitude, 
the physical equivalent of brightness. The wavelength is usually ex¬ 
pressed in Angstroms; i Angstrom (A) = lO”*® metre. (Frequency = 
velocity/wavelength = cjX.) I'he brightness can be expressed in energy 
units (as watts/m-): but this is often not very suitable as it does not lake 
into account the different sensitivity of the eye to different wavelengths. 
It is usual to take an international candle-power defined in terms of a 
special lamp as the unit of intensity of a source of light. The flux or 
amount of light per second falling on a surface is measured in lumens’, 
I candle-power lamp emits 477 lumens. The intensity of illumination 
of a surface is measured in lux: i lux — i lumen per square metre. At 
a distance d metres from a source of c candle-power the intensity of 
illumination of a surface on which the light falls normally is c/d- lux; 
if it falls at an angle 0 with the normal to the surface, the intensity will 
be (c cos 0 )ld'^ lux. 


x6.z8 Assay units 

Certain substances, such as newly discovered antiseptics, insecti¬ 
cides, and vitamins cannot be estimated by the usual chemical methods, 
since either their chemical structure is not known, or no appropriate 
scheme of quantitative analysis has been worked out. In such a case a 
bio-assay has to be performed and the strength of the unknown sub¬ 
stance estimated by its effect on experimental animals or animal tissue. 
It is then usual to invent an arbitrary unit describing the ratio of the 
potency of the specimen under investigation to that of a standard 
preparation, and in time such an arbitrary unit will become an “ inter¬ 
national unit Such international units may remain in use even when 
the bio-assay procedure has been replaced by a more exact chemical 
assay. The principal examples of international units are the following : 
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Substance 


Provitamin A 
Vitamin Bi 






c 

D 

E 


Penicillin 


LU. 

•0006 mg jS-carotene 
•003 mg pure synthetic 
•05 mg /-ascorbic acid 
•000025 mg pure vitamin D3 
I o mg a-tocopheryl acetate 
•0005988 mg 


Date of standard 
1949 

1938 

1934 

1949 

1941 

1952 


The mathematical and statistical processes used in assay work form 
an important but rather complicated and specialized part of mathe¬ 
matical biology. It would be impossible, for reasons of space, to do jus¬ 
tice to the subject here. The reader who is interested is referred to other 
books on the subject, such as D. J. Finney, Statistical Method in Bio- 
logical Assay (Charles Grilhn & Co., 1952) and J. H. Burn, D. J. Finney 
and L. G. Goodwin, Biological Standardization (O.U.P., 1950). A 
possible simplification of some of the methods has been suggested by 
J. Berkson, “ Application of the logistic function to bio-assay”, Joum. 
Amer. Stat. Assoc., 39 (1944), 375. 



CHAPTER SEVENTEEN 


METHODS OF SOLVING EQUATIONS 


17.1 Cubic equations 

We have already shown (in Section 3.4) how to solve quadratic 
equations. Cubic equations also have a fairly simple general solution, 
and quartic (4th degree) equations a rather complicated one. Here 
only cubics will be considered. 

The general cubic equation can be written in the form 

+ Cx^ + Bx A = o . . ( 17 . 1 ) 

with the coefficient of x^ equal to i. For if .v® has any other coefficient 
we can divide the equation through by this coefficient. 'J’hus the equa¬ 
tion 

^x^ — 2x- + 4.V — 3 = 0 

becomes on division by i.e. multiplication by 3, 

x^ — 6x^ + 12.V — 9 = o . . (17.2) 

while the equation 

2X^ — — 8a■ + 24 = o 

becomes on division by 2 

_ 3,v 2 — 4,r + 12 = o . . (17.3) 

The next obvious step is to try “ completing the cube ”, just as we 
complete the square for a quadratic. Now {x + = x'^ 4 - Ca- 4- 

J 4- iV U®; and this begins with the same a® and a- terms as the 
given equation -v® 4- Cx^ 4- Bx + A ^ o. Thus it is natural to try the 
effect of the substitution x 4 - i C = A', i.e. 

X = X — JC . . (i-y .4) 

On substituting this value for a- in (17.1) the equation becomes 

X^ +{B - k C^)X 4- (^ - i 4- C^) - o 

or, say. 

X^ 4 “ B'X 4- A' = 0 . . (^7-5) 

where B' = B ~ A' = A ~ J/JC + and the term in A- 

has now disappeared. Thus if we put jc == A 4- 2 in equation (17.2) 
it reduces to 


1 1 


A3 — I = o 

481 


(17.6) 
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and if we put .r — -ST + i in equation (17.3) it reduces to 

X^-^X + t^o . . . (17.7) 

Now if the term in X also vanishes, as in (17.6), the solution is simple. 
The equation X^ — i = o is equivalent to X^ — i, i.e. X = 1 (and, if 
we allow complex roots, X — and X ^ co- too). Since X was defined 
by the substitution x X + 2, it follows that the roots of the original 
equation are i + 2 = 3, a»+2 and + 2. 

When B' ^ o the solution of the equation X^ + B'X + .< 4 ' — o is 
obtained from a consideration of the identities 


sinh 3^ = 4 (sinh 0)® + 3 sinh 6 
cosh 3^ = 4 (cosh — 3 cosh 0 
cos 3^ = 4 (cos 6 y — 3 cos 9 

From the first of these three identities we can always find a solution u 
of any equation of the special form 4K® -}- 37^ — K. For let us put 
sinh“‘ u~d\ then u = sinh 6 , + 3M = sinh ^9 = K. If ^ is given 

we can readily find 3 9 from the tables of sinhs; division by 3 gives the 
value of 9, and u = sinh 9 . Similarly if jAT] > i the second identity 
provides a solution of the equation 477® — 377 = K. This solution can 
be written u = cosh (J cosh-^AT). If | ATI < i the third identity gives 
us three real solutions of the equation 477® — 377 = AT. For, on putting 
u = cos 9, we shall have 417® — 377 — cos 3^ = AT, so that 3^ = 
cos-^AT. However if 3 9 ^ is one possible value of cos-*A", then 3 + 27 r 

and 3^1 “ 1 " 4 ^ ^^so possible values, so that on dividing by 3 w’^e 
obtain 8 ^, 9 ^ + 577, and 9 i + In as possible values of 0, and cos 
cos and cos ((?i + stt) as values of 77. It only remains to coax 

the original equation A'® + BX + - 4 ' = o into the soluble form 


41/3 jf jg positive we make the substitution X = uV^B', 

and the equation X^ + B'X A' o reduces to iB’VfB' + 
uB'ViB' + A' = o, i.e. to 477=* + 377 = -- 4 'vX27/4B'3) = K. If B' is 
negative we similarly substitute X = uV—*B\ obtaining 4772 — 377 = 
— A' \/( — 2'j!^B'^) — K. In either case we can solve the resulting 
equation for 77, and from u we obtain X, and from X, x. 

I hus taking (17.7) as an example we must substitute X = 
whereupon the equation becomes 


4773-3„= -6\/(27/4-343) 

^ - V(243/343) = —S417 = K 

If therefore we put 77 = cos 9 , we find cos 3 0 = —8417, whence 3 0 = 
(2i8’68 + 36077) . 1 he possible values of 9 are therefore 70*89°, 

190-89° and 310-89° giving 77 = cos 0 = .3274, —9820, or -6546. 
From this we obtain A = ^7\^(28/3) = 3'05577 = i*ooo2, —3-0000, or 
1-9998 respectively, and so the original equation for .v, 

— 2X- — 4.V 4 - 12 = o 
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has the roots x = X 1 = 2 0002, —2 0000, and 2-9998. Actually the 
roots are 2, —2, and 3 exactly; the slight divergences in the last decimal 
are due to errors of computation. 

17.2 Newton’s method 

There is a very simple and general numerical method which will 
solve almost any equation. 

Let the equation he y = f{x) = o. The first step is to obtain an 
approximate idea of the positions of the roots. This can be done 
graphically, by plotting the curve y = /(x) very roughly. 

Next, a value Xi of x is chosen, not too far from one of the roots. 
Then provided that the function/(v) has a Taylor series it follows that 
f(^) =/(-^i) + ~ - for values of x near x^. If X is the 

root we are seeking,/(.A) = o, so that 

/(jfi) + (JV — 4 - • . • = o . . (17.8) 

Now if the value .v, we chose is sufficiently near the root X all the terms 
in (X — and higher powers will be negligible, and X can be found 
by solving the equation /(v,) + (A — ^,) = o obtained by con¬ 

sidering only the first two terms in (17.8). 'I'his gives x v, —/(.Vj) — 
/x(a*i) i e. the first estimate a:, has to be corrected by subtracting 
f(x\)lfx{xi). Now if Xi is judiciously chosen, the higher powers of 
(X — A,) will be small, in general not entirely negligible: so that 

^2 =- A, -/(x,)/A(a,) . . (17.9) 

will usually be a better approximation to X than x^ is, but not exactly 
equal to X. However we can then repeat the process on Vn, obtaining a 
third and still better approximation x^, and so on: after several steps of 
this kind /(.v„) will usually be found to differ inappreciably from zero, 
and x,^ will accordingly be the required root. 

The advantages of this process are: 

(i) It is very simple. If the form of the function f{x) is known, that 
of the derivative/,( a) can be found by differentiation. It is then as a 
rule a fairly easy matter to find/( a,) and /,(.v,) for any given value .v,, 
and thence to calculate the correction —/(Ai)//^(.v,). 

(ii) It is general: it can be applied to (almost) any function /(.v). 

(iii) It is very rapid. Provided the initial value v, is not too far 
from the root, the sequence of approximations .v„ Aj, .V3, . . . will 

approach the root at least with geometric rapidity, and usually much 
faster. 

(iv) It is self-correcting. If a small error is made at any stage in the 
calculation it will not affect the final result (provided of course that the 
error is not repeated in all further stages). 
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EXAMPLE 

We use this process to solve the equation f{x) = x -{- In a: = o. 
Since/(.v) is —00 when a: = o, and i when a* = i, there must be a root 
between these values. As a convenient starting-point we shall take 
.Vi = I. Also by differentiation fj.{x) — i i/a. The calculations 
therefore proceed as follows: 


Step r 

1 

1 

I i 

! 

2 ! 

, 1 

! 3 

> 

1 

4 

^ » «• •• ** 

I 

■5 1 

! -564 

•56714 

/(.Y,) -= .Yr + In .Y, 

(from tables) .. 

j 1 

1 1 

I 

— 193 

0 

r*^ 

00 

0 

0 

• 

1 

-f- '00002 

/,(av) == I -h i/av 

2 

; 3 

2*773 

2-764 

Correction = 

—5 

1 i 

[ 1 

+ 064 

-{-•00314 

i 

-OOOOI 


In each case the new value .Y;.4., is obtained from by adding the cor¬ 
rection, —/(•Y;.)/j.(.Vr). It will be seen that after 4 steps the correction 
amounts to only i in the fifth place of decimals: thus the required root 
is .V5*56713. 

As we have said, this method applies to any equation: and it is 
equally good for finding real and complex roots. But the special case 
of an /ah degree equation for which f{x) = .v" + . . . + Cx~ + Bx 4 * A 
deserves further attention. For such an equation we know that f{x) can 
be written in the form (.v— a) (.v—j 9 ) (-v—y) . .. (.v— A) where a, / 3 , y . . . A 
are the roots. Suppose therefore we have found one root a: it is then 
advisable to divide/(.v) — .v" -f . . . -tCa- -j- Bx A algebraically by 
A* — a. d’his is easily done according to the method explained in 
Section 3.6, and the quotient /(.v)/(.v — a) = ^(.v) will be equal to 
{x—ft) (.V—r) . . .(.Y —A), where ft, y, , . .X are roots which still remain 
to be found. Now ft is a root of ^^(.v) —/(a*) (.Y — a) = o; and this 
ct|uario!i is usually simpler to solve than the original equation f{z) — o, 
since it is of lower degree. Having found ft by Newton’s method we 
can divide ^(.v) by {x- ft) to give a quotient h{x), and go on to solve 
//(.v) o. In this way all the roots can be found. 

PROBLEMS 

(1) Solve the equation .v- • - z by Newton’s method, starting with 

= I. (I'his exemplifies a very quick method of finding square roots 

on a calculating machine.) 

(2) Solve -v® -- 3 by Newton’s method. 

For a polynomial a great deal of the work needed to calculate /{Xr) 
and f:r{Xr) can be shortened by the following ingenious devices, due to 
P. A. Samuelson. 
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Firstly suppose is real. Divide the original polynomial through 
by (a; — according to the method of Section 3 . 6 ; let the quotient be 
Qi(x) (which is a polynomial) and the remainder /?, (which is a number): 
then 

/(x) = (x — Xr) Qi(x) + 

identically. Now divide ^^(a:) again by {x — jc^), giving quotient Q^ix) 
and remainder so that 

Qi{x) = {x — Xr) Q^ix) + 

and /(^) = (a-— .Vr)= p 2 (jf) 4-/?o(x — .Y^) 4- 

From ( 17 . 10 ) we have by differentiation with respect to x 

f^(x) = 2 {x — x^) Q.i{x) 4- (Jc — x,y Qiix) 4- - (i? * 0 

whence on substituting x^ for x in equations ( 17 . 10 ) and ( 17 . 11 ) we 
obtain 

/(^r)= /?,. fr{Xr)=R„ 

and the correction to be added to x^ is simply — 

Xf-t-l = Xr RijR.t 

If .Vr is complex a great deal of the manipulation with complex quan¬ 
tities can be avoided by the following device. Let x^ be the conjugate 
complex of x,: then we know that 

{x — X,) {x — .v,) = .Y* — {x\ -h x\) 4- 

= x^ — 2 ax 4 - c (say) 

is a real expression. Divide f{x) by {x- — 2 ax 4- c): let the quotient be 
0,(.v) and the remainder R^x 4- R^. Divide (2 ,(y) again through by 
(y 2 — 2 ax 4- c) \ let the quotient be Q.>{x) and the remainder R ^x 4 - 7? 3 . 
Then 

/(a) === (.y 2 — 2 ax 4- c) Qi(x) 4- R^x + 7?, 

= (y 2 —2<7y4-c) 2 p2(A)4-(A:2—20^4-^) (/?4 y4-7?3)4 7?„y4-7?i 
= (y-.v,)’*(y-y,)2(22(a)4-(^-y,)(.v-y,)(7?j.y 4-/?3)4/?2.v-f7?,. 

From this we find by substituting x = x\, 

/(Xf) = R-iXr RI . . ( 17 . 12 ) 

and by differentiation with respect to x and substituting y = .v^, 

/AXr) = {Xr Xf) {R^X'r 4" 7?3) 4“ 7? 2 • . (i7-i3) 

If the coefficients yl, C. . . in the original polynomial /{x) = -f . , , 

4' 4- A are all real this process involves only real numbers 

right up to the point where the value of y, is substituted into the equal 
tions ( 17 . 12 ) and ( 17 . 13 ). We then find our next approximation to the 
root a as x, + i - .V;. /{Xr)l/x{Xr) and repeat the process. When the 
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correction becomes negligible we know that Xf. ~ a, the complex root. 
But since a must also be a root, this process finds two roots in one oper¬ 
ation. The original polynomial can then be divided by the real expres¬ 
sion (x — a) (-^ — a), and its degree reduced by 2 at a single step. The 
process \vill then be repeated (if necessary) until all the roots are foimd. 

17.3 Simultaneous equations for several unknowns 

If we have equations connecting 2 unknowns, say x y — 
xy = 6, the standard procedure is to solve one of the equations for one 
unknown, and then to substitute its value in the remaining equations. 
In this way it is “ eliminated Thus taking the two equations given 
above, x -{- y = 5, == 6 , we solve the first equation for x — $ — y, and 

substitute this value in the second one, giving (5 — y) y = ^. x has 
now been eliminated, and there remains a quadratic equation for y 
with the solutions = 2 andy = 3. Since x = 5 — y the correspond¬ 
ing values of x are 3 and 2; i.e. the solution of the given set of equations 
is either .v = 3, = 2 or a: = 2, y = 3. Similarly if we have the three 

equations 

X s = o 

X y — z — o 
xz — y 

we begin by solving the first equation for x^ obtaining x = —z. Substi¬ 
tution of this value in the other equations gives 

V — 23: = o 



We solve the first of these equations for j’, givingy = 2z; substitution 
in the remaining equation gives — z- = zzy i.e. z = o or — 2. Since 
y = 2Zy .V =- — z, the possible solutions are therefore .v = o, ^ = o, 
z = o and .v = 2, y = —4, z = —2. 

7 his procedure shows that to determine n unknowns we need in 
general n equations; for we need {u — i) equations to eliminate (« — i) 
unknowns, and one further equation to determine the last. Such an 
elimination may call for considerable skill in manipulation. If however 
all the equations are linear it can be systematically performed. 

• 

17.4 Linear simultaneous equations 

The usual method of solving linear simultaneous equations such as 

5.V -\-2y -'r z = y \ 

5.V + 8y 4- 33: = —8 - . . (17.14) 

2X H- 2y — yz =~6 ) 

is explained in text-books on elementary^ algebra. It amounts simply to a 
successive elimination of each of the unknowns, finally ending up with 
an equation containing one unknown only. This method of solution is 
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about as efficient as any, but a great deal of the effort required can be 
saved by using a systematic form of calculation. We shall explain 
several alternative schemes below. 


Scheme i—Elimination of the unknowns x, y, z, in that order 


Stage 

0 = 

5 ^ + 33 ^ + ^ —7 — 

2T 


I 

0 = 

5x- 4- 8y 4- 3-3^ +8 - 

24T 



0 = 

3.V 4 - 2y — yz 4-6 — 




X = 

— •6y — ‘2z 4- 1*4 + 

■4T = 

3-25 

2 

0 = 

S'Oy 4" 2-oxr 4- iS'O — 

220T 



0 = 

■2y — 'j'bz 4- 10-2 — 

2-8r 


3 

y = 

— ' 4 - — 3'0 + 

44^ = 

-3-50 


0 = 

— 7-682 + 9-60 — 

1*92 7 ' 


4 

z ~ 

1-25 — 

■257' = 

1-25 


Explanation 

For the moment ignore the letter 7', which is explained later. 

The first stage is to write the three equations with o on the left-hand 
side, and all other terms on the right. 

Now take the first equation, o = 5A; + 3j -}- xr — 7, and solve it for 
x\ i.e. multiply through by —I (minus the reciprocal of the coefficient 
of x) and take x over to the left-hand side. The resulting equation, 
X ~ — 6y — ■2z 1-4, is the first equation of stage 2. We shall call 

the equation (o = 5X + 3>' + 5? — 7) which is thus solved for .v the 
“ pivotal equation ” of stage i, and the .v-term in it the “ pivotal term 
This term is indicated in the scheme by heavy type. 

X is now expressed in terms of y and z; this relation can be used to 
eliminate x from the remaining two equations of stage i, obtaining the 
remaining two equations of stage 2. Thus the substitution a: = 
— 6y —•2z -f 1*4 the second equation o = 5.V + + 3:? 4- 8 of 
the first stage gives the second equation o = 5 o_y -f- 2'oz H- 15 0 of 
the second stage. This is merely a matter of multiplication and addi¬ 
tion: 8 + 5 X (—6) = 5 0, 3 4- 5 X (—2) = 2 0, 8 4- 5 X 1-4 = 
15 0. Llimination of x from the third equation of stage i gives in the 
same way the third equation of stage 2. 

We now choose the second equation of stage 2, i.e. o = 5 0^ 4- 2 0^ 4- 
15 0, as the “ pivotal ” equation to be solved fory. (The term s oy is 
therefore the pivotal term and is shown in the scheme in heavy type.) 
We obtain y = —4a — 3'0, the first equation of stage 3. Substitution 
of this in the last equation of stage 2 gives the last equation of stage 3, 
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o = —7-685: + 9-60. This will in turn be the pivotal equation to be 
solved f^or Zy giving (in stage 4) a = i -25. We now substitute this in the 
first equation of stage 3, obtaining;)'= —3'5o; a further substitution in 
stage 2 gives x — 3*25, and the equations are solved. The solution 
.V — 3 25, y= -3-50, z= 1-25 is checked by substitution in the 
original equations: thus S-v + 3^ + 5: = 5 x 3-25 + 3 X (—3*50) 
-f 1-25 = 7, as should be. 

But it is useful to have a further check to catch any errors in the 
calculation as they occur. This is the use of the symbol T. In stage i 
we add to each equation an extra term aT : the coefficient a is equal to 
the sum of all the other coefficients in the equation (including the con¬ 
stant term) but with the sign reversed. Thus in the first equation 
5 + 3 + 1 —7 = 2, so the extra term is —aT. The whole elimination 
is then done with this extra check term included, treating T as an ordin¬ 
ary algebraic symbol. In ever\' equation it should then be found that 
the sum of the coefficients on the left-hand side is equal to the sum on 
the right: if not, there is an error. Thus in the first equation of section 
3 we have i = —4 — 3-0 -|- 4-4. 

1 his is the general scheme of solution. However, in special cases 
the work can be cut down by various devices. Thus an inspection of our 
equations shows that it would have been simpler to eliminate z instead 
ot .V, using the first equation, since the coefficient of z is i. When we do 
that the calculation proceeds as follows: 


Scheme 2—Elimination of unknowns in order z, y, x 


Static 


I 

0 


33 ’ z 


7 “ 

- 2 T 



0 = - 

5.V 4- 

Sv 3z 

a. 

8 - 

-mt 



0 

A* i 


1 

—»— 

6 - 

- 47 ^ 


2 

•s* 


33 ’ 

+ 

7 - 

f- zT 

= 1-25 


0 - 

— lO.V — 

y 

1 

29 - 

- 1ST 


0 - 

3.S.Y L 

233’ 


43 - 

~ iST 


3 

y ~ 

— I O.Y 


—- 

29 - 

- iST 

= -3-50 


0 

- I92.V 


-i. 

1 

624 - 

- 4327 ’ 

4 

.V = 




3 25 - 

- 2-25r 

3 25 


Here we are fortunate enough to find a second simple pivotal ele¬ 
ment, y, in the second stage. I he calculations proceed exactly as in 
Scheme i, except for the different order of elimination, which here 
simplifies the arithmetic. 
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17.5 The inversion method 

A method which is sometimes helpful is to solve the equations using 
letters instead of particular numbers on the right-hand side. Thus in 
our example we would solve 

5-* + 3:v + « ) 

5^ + + 35: -- r . . (17.15) 

-j- zy — jz = w ] 

and then put u = 'j,v = —8, w — —6 to obtain the final answer. The 
calculation proceeds exactly as before: we begin by moving all the terms 
to the right-hand side. 


Scheme 3—Inversion of equations 


0 0 0 

II II II 

5 x + 
5 x 4 - 
3 ^ + 

3 y + 
Sy 4 - 
23; - 

z 

3 ^ 

yz 

— u 

— V 

— to 

sr 

'ST’ . 
3T 

z == 

- 5 -v - 

3 y 


4 - u 


4 - 

8 T 

0 = 

— 1 o.v — 

y 


+ 3 « 

— V 

+ 

9T 

0 = 

38a; 4 - 23y 


— 7 » 


— w — 

S 3 T 

y = 

— I o.v 



4 - 3 u 

— V 

4 - 

9 T 

0 = 

— 192X 



4-62M 

— 231; 

— to 4-i54r 


whence on solving for x and substituting back 
X = (62W — 23«? — w)jl<)z 
y = ("44H + + iozf)/i 92 

Z = (14M V — 2 $iv)fl<) 2 . 


It is easy to check that these values do satisfy {17.15). In particular, 
when u — 'j,v = —8, w ~ —6 we find x = 3*25,= -3-50, z = 1*25 
as before. 

17.6 Linear equations with integral coefficients 

In Scheme i above we began by dividing the first equation by —5, 
the coefficient of the pivot x with reversed sign. This would have been 
inconvenient if the coefficient was 6 or 3 instead of 5, since it would 
have introduced an unending decimal. So when the coefficients in the 
equations are whole numbers it is useful to modify the scheme to 
replace division of the pivotal equation by multiplication of the non- 
pivotal ones by suitable factors. Thus to solve the equations 

3 * + 3 >' - 7 -® = 4 

(ix iiy ~ jz = —9 

yx 4 - ^y Jf. (^z = —8 
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we proceed as follows: 


Scheme 4—Solution of equations with integral coefficients 


% 

Stage 

Multiplier 

Equations 

I 

3 X : 

1 

0= 3X + 2y — 7^— 4+ 5^ 

0 = (ix iiy — yar 4- 9 — iqT 

0 = ']x 33' -f 9^ 4 - 8 — z']T 

la 


3 ^= — 3 jy 4- 7 ^ 4- 4 _ 5 ^ 

0 = 6a: 4 - ii3» — jz 9 — i9T 

0 = 21a: 4-93/4- 27a; 4- 24 — 81T 

2 

f X 

0 = S>' + 7 - 2 ^ + 17 — 29T 

0 = — 123' 4- ybar 4- 52 — nbT 

2a 

1 

1 

1 

\ 5 y= — 72r — 17 4- 29r 

0 = — 153' + 95 - + 65 — I 45 r 

3 


0 = ii6z 4-116 — 232?' 

4 


« = — I 4 - 


whence on back substitution 


^ y — —2 (from stage 2a), .v = i (from stage la). 

Exphmation 

1 lie term 3,r in the first equations is chosen as pivotal element. To 
use this \vc have to bring all the coefficients of .v to be multiples of 3: 
that means that the third equation must be multiplied through by 3. 
In stage I a we have taken the 3.V to the left-hand side, in preparation for 
elimination, and \ve have multiplied the third equation by 3. It is now 
e,i.s\ to eliminate .v: we find the value of 6.v for the second equation in 
stage la by multiplying 3.Y by 2, and the value of 21.V for the third 
equation by multiplying 3.V by 7. Substitution of these values gives 
stage 2, ^\here .v has been eliminated. now choose 53* as our second 
pivotal element. The second equation in stage 2 must therefore be 
multiplied by 5, to make its coefficient a multiple of 5. But we also 
notice that it has a common factor 4, which can conveniently be taken 
out by division by 4: i.e. we choose 5/4 as our multiplying factor for 
this equation, obtaining stage 2a. It is now easy to eliminate jy, since 
tsy in the second equation can be obtained by multiplying the value 
of Sy io the first by 3. This gives stage 3, which is solved to give 
3 = I. Substitution of this in the first equations of stages za and la 
respectively gives the values y = —2, .v = i. 
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We make a note of the multiplying factors we have used for the non^ 
pivotal equations; these are shown in the “multiplier” column and 
will be required later. 


PROBLEMS 

Solve the following equations by various methods. 


(0 



2 X 

4- 

3 y = 

3 




5 X 

i 

yy ^ 

25 

(2) 

x 

+ 

y 

4- 


6 


X 

4 - 

3 y 

+ 

2 Z = 

11 


4.V 

+ 

y 

■■ «■ 

SS = 

9 

(3) 

5^ 

4- 

3 y 

4- 

z = 

2 


5^ 

+ 

8 y 


3^ = 

3 


2 X 

— 

y 

4 - 

6z = 

— 3 


17.7 Solution by successive approximation 
The equations (17.14) may be written 

Sx + ^y -\- z — y = 0 ] 

5Jc: + Sjy + 33: -b 8 = o V . . (17.16) 

2y — yar + 6 = o J 

Another method of solving these equations is to start off with trial 
values of .v, y, and z, and then gradually modify them to make the ex¬ 
pressions $x-\-$y-\-z — y=^U (say), 5.V + + 33r — 8 = K, and 

^x + 2y — 7a: 6 = W approach more and more nearly to o. (This 

is a special case of the “ relaxation method for further details, see 
L. Fox, y. Roy. Statist. Soc., B, iz (1950), 131.) 


Scheme 5—Successive approximation 


X 

y 

z 

0 

0 

0 

1 

— li 

1 

1 

2 

1 

1 

1 

1 

—2 

I 

■3 

-36 

- 1 

<^= 5 ^ 4 - 3 >’ 4-^—7 ! 

H 



I 

—5 

0 

■3 

— •78 

—88 

T= 5 a: 4 - 8 jv 4-324-8 



10 1 

J 3 

— 3 

2 

1 

2-0 

•02 

-•28 

3 ^ 4 -zy—yz 4 -6 

H 



3 

- I 

1 

2 1 

-1 

-82 

—12 

r=i 3 * 4 -i 3 >'—324-7 

7 

~6 

1 

20 

17 

1 

1 

-9 

4 

3*1 1 

; 1 

-158 

— 1-28 
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X 

3 

■18 



•03 


y 

-336 


—08 

1 

1 

-03 

S 

1-2 , 

1 

1 

1 


•04 

1 


U=$x-\-^y-\- z—y 

—88 

•02 

—22 

1 

— 18 

—03 

-12 

V=5x-\-Sy-\-2z-\-S 

—28 

•62 

—02 ! 

•10 

1 ^ 
•25 

■01 

IF= 3 .V + 2>'—7 jsr-b 6 

— 12 

•42 

■26 

— 02 

•07 

•01 

r=i 3 .v-l-i 3 y— 3 ar -|-7 

• 1 

— 1-28 

' i 

4 

I 06 

•02 

1 1 

-10 

•29 

-10 


Totals; a: = 3 235, y =— 3-45, ar = 1-24. 


025 



—10 *225 


Explanation 

T is here the sum XJ V and is provided as a check: at any 

stage in the computation the number T in the last row should be the 
sum of the three numbers JJ^ \\ and W above it. 

We start by giving .v, and z any suitable values. Here we have 
chosen .v = o, = o, z — as shown in the second column of this 
scheme: the corresponding values of IJ, V, and Ware —7, 8, and 6 
respectively, as compared with the desired values o, o and o. The big¬ 
gest discrepancy is in the value of K; we therefore try to reduce V by 8 
or thereabouts. Now since the largest coefficient in the expression for 
p _ ^ + 3« -f 8 is that of 3’, the most economical way of reduc¬ 

ing i to zero is to add —i to y. This correction toj^’ is marked opposite 
in the third column: clearly it adds —3 to U, —8 to V, —2 to W and 
— 13 to T, so that (J, V, If’’and T now take the values —10, o, 4 and 
6 respectively. The greatest discrepancy is now in f/, and can be 
corrected by adding 2 to x. This correction is shown in the fourth 
column: it adds 10 to V, 10 to F, 6 to IF, and 26 to T, giving the 
respecti\e values o, 10, 10, and 20. We now correct W: it is not 
necessary' to bring it exactly to zero, since in any case the next step is 
likely to make it non-zero again: so it is good enough to add i to z. 
.After eight corrections of this kind U is reduced to —88, V to —28 
and IF to —12: the values of .v. v, and ^ are found by adding all the 
corrections, i.e. .y — o -|- 2 -r i = 3, y = o — i — 2 — *36 = —3*36> 
xr ~ o + I 3 I r-z 12. These values are transferred to the sec¬ 
ond part of the table and the process continued. After six more oper¬ 
ations \\e have the values .v —~ 3 ~b 't8 -f- -03 -F ‘025 ^ 3*235> y ~ 
3 ' 45 ' ~ t‘24, while U, I , and IF have been reduced to ’005, *135 

and *085 respecti\ cly. It is clear that we are approaching the roots 
.V = 3*25,^’ = 3 5, z — 1-25 which we have already found otherwise: 

but the approach is rather slow. In fact for 3, 4 or 5 equations in an 
equal number of unknowns the direct method of elimination is quicker 
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and more satisfactory. But when a large number of simultaneous equa¬ 
tions have to be solved the process of successive approximation is usually 
the only practicable one: the labour involved in the elimination method 
is prohibitive. 

17.8 Inconsistency and redundancy 

The methods we have described will as a rule give a unique solution 
to n equations in n unknowns. But occasionally they may fail: and it is 
rather important to consider how and why. There can be no values of 
X and y satisfying the two equations x + _y = i, 2 x -}- zy = 3 simul¬ 
taneously. For the first equation becomes 2x + zy = 2 on multipli¬ 
cation by 2: and this is a direct contradiction to the second. On the 
other hand the pair of equations x* -}- ^ == i and 2.v + zjy — 2 has an 
infinite number of solutions. For the second equation is effectively a 
repetition of the first, and any pair of values for which .r + y = i will 
satisfy both equations. There are an infinite number of such pairs; we 
can take x to have any value we wish, and then y will be i — .v. 

This is easily interpreted geometrically. Any linear relation as 
2x — y = I can be considered as the equation of a straight line, L, 
treating .v and y as co-ordinates. 'I'his means that if .v and y are numbers 
such that 2x ~ y = 1, then the point P with co-ordinates (.v, y) lies on 
L. Conversely, if P lies on L, 2x — y ~ o. Now a number of such 
lines are plotted in Fig. 3.2: in particular we have 

(<j) X — 2y - • o, i.e. y = A.v 

(b) X — 2y = —4, i.e. y = 2 + 

(c) 2 x — y = I, i.e. y = I 4 - 2x 

{d) -^x — 6y = —12, i.e. j 2 4- i.v 

(«-) 4^- 4 2y = 3, i.e. y =--• 1-5 — zv. 

Now in general any two lines will meet in one point and one point 
only: and the co-ordinates of this point must then satisfy both equa¬ 
tions simultaneously. But a pair of equations such as {a) and (b) is 
evidently inconsistent, i.e. has no solution: from Fig. 3.2 we see that 
they represent a pair of parallel lines, which have no intersection. On 
the other hand (b) and (d) are equivalent equations, and geometrically 
are represented by the same line: so that any point on the line satisfies 
both equations simultaneously. The pair of equations is then said to 
have “ redundancy ”. 

Similarly an equation ax 4 - by cz (/connecting three unknowns 
can be considered as the equation of a plane in space. In general three 
such planes will meet in a single point. For two of them intersect in a 
line, and this line meets the third line in a point. However, if any two 
of the planes are parallel, but do not coincide, then there can be no point 
of intersection. Likewise if any two intersect in a line parallel to the 
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third plane there will be no intersection of all three, and the equations 
are inconsistent. If all three planes intersect in a line (e.g. any three 
planes passing through the north and south poles) all points on this line 
must satisfy all three equations; and likewise if the three planes coin¬ 
cide there will be an infinite number of solutions. It seems therefore 
that for any system of linear simultaneous equations there are three 
possibilities: either there will be just one solution (which is the usual 
case), or there will be no solutions, or else there will be an infinite num¬ 
ber. But it is impossible for there to be (for instance) three solutions 
and no others. 

We can prove this in general as follow's. We first observe that the 
process of elimination is a reversible one: if we take the tw'o equations 

2 x -h 3 y — 7 = 0 

4 ^+ 9 J —17 = 0 - • • (17-17) 

we can eliminate x from the second by subtracting twice the first 
equation: 

(4.V +9y - 17) - 2(2x +3y -7) = 3y — 3 

and so we obtain the new pair of equations 

2.V + 3 >’ — 7 = o 

3 y — 3 = ° • • • ■ (17-^8) 

It is clearly true that if x and y satisfy the equations (17.17), they must 
also satisfy (17.18). Conversely if (.\r, y) is any solution of (17.18) we 
can show that it must also satisfy (17.17). For the first equation of 
(17.17) is identical with that of (17.18), and by adding twice the first 
equation (17.18) to the second, 2(2.v -f 3^ — 7) + (sy — 3), we repro¬ 
duce the second equation 4.V + qy — 17 = o of (17.17). In brief, the 
two sets of equations are equivalent. 

The same holds for the solution of three equations in three varia- 
ables. We can perhaps see this most clearly in the following way. Con¬ 
sider the equations (17.14) once again: 

+ 3 :^' + = 7 

5.V -f- 8y -h 3^ = —8 

3 a: + 2y — 7 ^ = —6 . . . (17.14) 

We can eliminate x from the second equation by subtracting the first; 
we can eliminate .v from the third by multiplying the first by f and sub¬ 
tracting it. This gives the new system of equations 

5.V + 3y + ar = 7 

5-oy + 2‘oz — —15-0 

‘2y — ’j-6z = —10-2 . . (17.19) 

I'hese steps are reversible: if w'e add the first two equations of (17.19) 
we reproduce the second equation of (17.14). Thus (17.14) and (17.19) 
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are equivalent in the sense that any solution of one must be a solution of 

theother. We can also interpret the new system (17.19) in this way: the 

first equation is one which determines when y and z are given; and 
the other equations relate to y and a: only. We can now proceed one 
step further, multiplying the second equation by •2/5 0 and subtracting 
from the third to eliminate y; we obtain 


5 ^ + = 7 

5-oy + Z'QZ = -15-0 

— 7-863r = — 9 60 . . (17.20) 

This again is a reversible operation, so that (17.20) is equivalent to 
(17.19), and therefore to the original equations. Notice further that the 
leading terms 5^, 5 oy, and —7-86a' in equations (17.20) are identical 
with the pivotal elements printed in hea\y type in Scheme i of Section 
17.4. (For this method of elimination is effectively identical with that 
used in Scheme i, although set out in a different way.) 

Now the first equation of (17.20) determines .v in terms of y and 
the second determines y in terms of ;r; and the third fixes the value of xr! 
This shows that .v, y, and z are completely and uniquely determined by 
the equations. 

We can apply this line of argument to any system of linear equations 
in any number of unknowns, say .v, y, ar . . . w. Suppose the first equa¬ 
tion contains the unknown .v (with a non-zero coefficient); then it can 
be expressed in the form x = ^y + fix? + . ..+//«,+ A', and so 
used to eliminate a; from all the other equations. If one of the remaining 
equations contains, say, the unknown y, it can be written as y = B'z -|- 
• • . + H'w -f K\ and soy can be eliminated from the remaining equa¬ 
tions, and so on. If n equations are given, connecting the n unknowns, 
it is usually possible to eliminate each of the variables in turn in the 
usual way, and that will give us a unique solution. Sometimes, however, 
the process comes to a premature conclusion: e.g. if we take the equa- 

tions * + J ^ 2.V + 2y + 2Z = 2, 3.V + 33, + 3- = 3. and ^se 

the nrst to eliminate x from the others, they then become 

X -\- y z = 1 
0 = 0 
0 = 0 


and we can go no further. The first equation gives the value of ;c in 
terms of y and ar; the second and third place no restriction on y or a 
So the conclusion is that y and a can be given any values we wish and .v 
IS then determined. But if we consider instead the equations x + v 

if — \ and 3a: + 3^ 4_ 3^ = 20, and used the 

nrst one to eliminate x from the other two, we obtain 


-f-y + a = I 
0 = 0 
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and this is a contradiction: i.e. we were wrong in assuming that the 
original equations had a solution. 

Now the process of elimination can only come to a premature halt 
when all the remaining unknowns have disappeared from all the re¬ 
maining equations. (Otherwise one of the equations could be used to 
eliminate still one more unknown.) Thus each of these equations must 
have been reduced to one of the forms o = o or o =; /f (with k different 
from o). The second form is a contradiction, and shows that the equa¬ 
tions have no solution. If on the other hand all the remaining equations 
have been reduced to o = o, they convey no information about the un¬ 
knowns not so far eliminated, and so those unknowns are free to take 
any arbitrary values. This establishes our proposition that a system of 
linear equations can have either no solution, or one solution, or else 
an infinity of solutions. In the last case we can choose arbitrary values 
for a certain number of the unknowns, and the rest will then be fixed. 
We can always find out which of these three cases holds in any parti¬ 
cular case by actually solving the equations. But there is also an inter¬ 
esting theoretical test, which we now go on to describe. 

PROBLEMS 

Which of the following systems of equations are (a) uniquely solv¬ 
able, (A) inconsistent, and (c) redundant? 

(1) .V -r y + = i, 2.v + y -h S« = 6, .v + 3y -f 22: = 20 

(2) a: + 3y -\- z = i, 2.v — y + 3^? = —2, .v ~ i ly + 3ir = 8 

(3) — 3 y +22= I, 2.V + gy + 9- = 22, 2 + 3y + 4^ = 9 

(4) 2.V 4- 3_y — 5~ = 6, 3.\' + 2y — 42 = 9, A' — y + = 11. 

is ) -v - y - 2 = I, 2.V -hy - 2 = 3, .V -f 2y = 4 

17.9 Determinants 

First consider two equations a^x -I- 6,y = hi, ~ 

the two unknowns, .v and v. These equations can equally well be 
written y — rt,.Y 6, -f hubi^y — —a^x bi + provided that both 

bi and h., are ditferent in value from zero. Thev therefore represent 
two straight lines, with slopes —and —a^ b^ respectively. The 
original equations will have a unique solution when these lines have a 
single point of intersection, and that will be so unless they happen to 
be parallel, that is, of equal slope, in which case —= —ajb^’ If 
we multiply this equation through by ~bJ)o it becomes aj}^ — 
i.e. a^b i — a Jby o. So the equations fl,.v -f 6,y =//j, a^-yb^y 

//., will have one and only one solution if the quantity Oib^ — is 
different from zero; whereas if a^b.^ — =- o, they will have either 

no solution or an infinite number. In this demonstration w'e have 
assumed that b^ and b^ are both different from zero. But (as the reader 
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can easily verify) a careful consideration of the cases when either 61, or 
b2, or both, are zero shows that the conclusion is still true then. 

A similar investigation of the set of three equations in three un¬ 
knowns, 

-t- b^y + c^z = /i, 
b^y + c 2-3' = // 2 

a^x b^y c^z ^ . . . (17.21) 

shows that they also have one and only one solution, provided that the 
quantity 

d Q^b 2^2 ^ ^b j —h ^ 3^ 1^2 a^b gCt, a^b 2 q j . • 

is different from zero. If J = o they have either no solution, or an 
infinite number. The expressions — <^2^1) A, which deter¬ 

mine whether the equations have a single unique solution, are known as 
“determinants”. We could go on to define determinants of higher 
order in this way. But it turns out to be simpler to approach the prob¬ 
lem indirectly. We begin with a quite different definition of a deter¬ 
minant, and study the consequences of this definition. We return later 
to the problem of the solution of simultaneous equations, and show how 
determinants can be applied to it. 

PROBLEM 

(i) Use formula (17.22) to determine which of the sets of equations 
in the Problems of the previous section are uniquely solvable. 


We shall illustrate the definition of a determinant by first considering 
one of the third order. Suppose we have any array of — 9 numbers 
(or elements) set out in a square, which can be written systematically as 
follows: 

<2, b^ Cy 

Q 2 ^2 ^2 
o 3 b ^ £“3 

We now form all possible products of three numbers chosen from this 
array, with the proviso that no two of the numbers may come from the 
same row or the same column. Thus a would be one possible choice : 
this is known as the “principal diagonal” of the array (sloping down¬ 
wards from left to right), a^b^c^ would also be a possible choice. Hut 
would not be allowed, because and b. are in the same row 
and would not be allowed, since <7, and are in the same col¬ 

umn. We can also state the rule in the form: each of the letters a h 
and r must occur once in the product, and each of the suffixes must 
occur once. Alternatively we can regard it as the problem of placing 

KK 
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three rooks on a chessboard so that no two are in a position to attack 
one another. 

We now give a sign to each of these products according to the 
following rule. Take any product, such as aJ)2Cx. From each of the 
elements a 2, 63, occurring in the product draw a line horizontally to 
the left, and another line vertically upwards (Fig. 17.1). Count the 



b. 

Cl 


b, 

1 

1 

Cl 

• 

d. 

as 

\ bz 


«2 

bz 

Cz 1 

1 


«3 

^3 

Cz 

as 

■ bz 







b. 

C4 

d. 


Fig. 17.1—Siga of a term in a determinant 


number of intersections of these lines; if it is even, the term has a 
(+) sign, and if odd, a (—). Thus for the term ajbz^x there are two 
intersections, and the sign is +. Proceeding in this way we find that 
there are three terms with a + sign, namely aitd ajhiC^y 

and three terms with a — sign, namely —afizCzy — ajiiC^y — 

We now add these terms: the sum 

is the determinant. It is therefore a single number calculated from the 
9 given numbers a,, 6,, etc.; and it is symbolized by writing a vertical 
stroke on each side of the array 

a-x by Cy 

d = (12 b 2 c 2 

Qz b z C z 

(This equation is read “delta equals determinant byt c,, fl2» 

bzy Cz"’ This use of a pair of vertical strokes should not be confused 
with the similar symbol |.vl meaning the absolute value of .r; there is no 
connection between them, except in appearance. But there is no danger 
of confusion, since the absolute value always relates to a single symbol, 
and the determinant almost always to an array of symbols.) 

We can go on to define determinants of higher order. Thus a fourth- 
order determinant 
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will be calculated from an array of i6 numbers arranged in a square. 
From this array we shall select every possible product of 4 elements of 
which no two occur in the same row or the same column (or equiva¬ 
lently, all products such as which contain all 4 letters and all 

4 suffixes). To each such product a sign + or — is assigned by drawing 
lines to the .left and upwards from each element occurring in the pro¬ 
duct, and counting the intersections. Thus Fig. 17.1 shows that there 
are three such intersections obtained from the product a this 

is an odd number, so the product has a — sign. Finally all the terms so 
obtained, with the correct sign attached, are added together. In the 
general fourth-order determinant it will be found that there are 12 
positive terms and 12 negative ones, and the number rises rapidly as 
the order goes up. 

There are simple rules for evaluating determinants of the second and 
third orders (Fig. 17.2). A second-order determinant = 
i.e. the product of the elements in the principal diagonal rninus the 







Fig. 17.2—Second- and third-order determinants 


product of the other two elements. For a third-order determinant we 
imagine the first two columns repeated on the right. The three positive 
terms arc then given by the products of the three downward sloping 
diagonals, and the three negative terms by the 

three upward diagonals, ^3^20,, c^a^h^. These special rules do 

not extend to higher order determinants, and other methods have to be 
used to evaluate them in practice. 


EXAMI’LES 


(I) 


( 2 ) 


( 3 ) 


3 4 i 

1 2 

2 4 


I 

3 

4 


1.4 — 2.3 = —2 


= 1.4 — 2.2 = o 


2 I 
o 2 

2 3 


1.0.3 i 2.2.4 T *-3-2 — 4.0.1 — 2.2.1 — 


3 - 3-2 


o -I 16 + 6 - o — 4 — 18 - o. 
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FURTHER PROBLEMS 


(2) Find i 

2 4 


I 2 i 

I —2 


0 3 


I 3 t. 

I —I 


( 3 ) Find 


I O I I 

O I o 

I 2 I 


1 2 3 

3 J 2 

2 3 I 


17.10 Multiplication of a determinant by a number 

We now consider what properties of determinants can be deduced 
from their definition. The first is that if we multiply any column of a 
determinant throughout by a fixed number k, the value of the determin¬ 
ant is also multiplied by ky i.e., 


kai 


! 

1 

kb^ 

<^1 


Ox 

kx 

1 ka 0 

bo 

C2 1 = 

ao 

kbo 

Co 

= k\^ 

ao 

bo Co 

i 

*0 

^ 3 : 

03 

kb 2 


1 

1 0 3 

b 3 c 2 


(17-23) 


For the determinant 


«i 

a ^ ^ Co 

<2 3 b 2 c 2 


is the sum of products of the form 


i containing one of the numbers <2,, Ooy and a3. When <3j, 

and are replaced by ka^y kao, and ka^ respectively the product be¬ 
comes i kOf Cfy with the same sign as before. Thus each term is multi¬ 
plied by ky and the whole determinant is therefore multiplied by k. 


Example 


3 


1 

1 ' 

2 i 

i 3 

2 ; ; 


1 SSSm 

4 : = l 

i 3 

4 ! 

9 


I 

3 




as can be checked by direct evaluation. 


A similar proof shows that if all the elements in any one row are 
multiplied by k the whole determinant is multiplied by k. It follows 
(l>y taking h = o) that if all the elements in any one row or in any one col¬ 
umn of a determinant are zero, the determinant itself is zero. 


17.11 Addition of determinants 

There is no simple and general rule for the addition of two arbitrar)’ 


determinants. But two determinants such as 


Ox 

Oo 


02 b 


b\ ^*1 I 

^2 f2 • 2nd 
3 ^3 



AI b^ Cj I 

A 2 bo Co which are alike in all columns but one can be 

A3 b 3 c 3 
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added by adding the columns which are different in the two arrays, 
leaving the others as they are: 

(<2i + yi,) by Cy 

^1+^2= (<^2+^2) ^2 ^2 . . (17.24) 

1^3 ^3) 63 ^ 3 ' 

For each term in zl , will be of the form ± ajy.cy, and each term of J , 
will be d: vith the same sign. On adding these terms we obtain 

±{ar + A;)b^t, which is the typical term of the determinant on the 
right-hand side of equation (17.24). 

A similar rule applies to the subtraction of determinants which differ 

in only one column, or to the addition and subtraction of determinants 
which differ in only one row. 


E.XAMPLES 


(0 

1 3 

2 4 

+ 

2 3 

1 4 


3 3 

3 4 

(2) 

I 7 

5 9 ' 


I 2 

5 3 

j 

! I 5 

5 6 

(3) 

' 2 4 

0 7 

+ 

2 4 

3-2 

f 

2 4 ! 

3 5 1 


(by addition of first columns) 

(by subtraction of second 
columns) 

(by addition of last rows) 


17.12 The epsilon symbol 

The definition of a determinant can be written in a very compact 
form by using a special “ epsilon symbol”. We shall define the third- 
order symbol c,,, as an example. This has three suffixes, r, j and t each 
of which is allowed to take the values i, 2, and 3; i.e. we can write 
^ 123 > Ot" ^323, but not € j3i. 

If^ny tw^of the suffixes are equal then e,,, is defined to be o; e.g. 
^221 — ^111 ~ ^313 = 0 . 

If all the suffixes are unequal, then is defined to be i if the 
product aj,.c, is to be given the sign + in the third-order determinant 

definition)" ^'^en the sign It follows that (by 

^ ^ “»Vr • . . (17.25) 


summed over all possible values of a, fi, and y, i.e. a = i 2 or t 

° u’ ^ ^ more than a concrete 

symbol for the sign -}- or - to be attached to the term aJ, r, It is 

given the value o when tw^o suffixes are equal because products like 

with two elements in the same row, do not occur in the deter 
minant. 


We can, in a similar way, define epsilon symbols with any number of 
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suffixes. The fourth-order determinant will be written aabfields, 

where a, jS, y and 8 can take any of the values i, 2, 3, 4. In the case of 
the second order there will be only 4 epsilons: cn = = o> «i2 ~ 

^21 = I. 

The rules for multiplication of a determinant by a constant, and the 
addition of two determinants can then be written very concisely as 

^ = k 2 ^a^yCtob 

^ ^ ^ab^Cy = E €a^y (^a + Ag) b^Cy 

and so become evident consequences of the properties of the summation 
sign 2 “ (Sections 11.5 and 13.14). 


17.13 Transposition 


If we interchange rows and columns in a determinant, i.e. reflect it 
in the principal diagonal, the value of the determinant remains unaltered. 
In symbols, 




^1 



ao 

^3 

! «2 


Co 

w 


*1 

bz 

bz 

i a 3 

*3 

^3 

1 

Cl 

Cz 

c 3 


(17.26) 


This operation is known as “transposition”. For each term of the deter¬ 
minant is a product of three elements, no two of which occur in the 
same row or column. It follows that any product such as ajj^Cx which 
occurs in the determinant must also occur in the transposed deter¬ 
minant. It remains only to show that the signs of the terms are not 
altered by transposition, i.e. by reflection in the principal diagonal. A 
little thought will show that the horizontal and vertical lines used in the 
construction for determining the sign are merely reflected in the prin¬ 
cipal diagonal. The number of their intersections is therefore unchanged, 
and so the sign remains the same. 


17.14 Interchange of columns 

If we interchange any two columns in a determinant we change its 
sign. 



hi 

1 1 

b. 


Cl 

' 




hz 


b. 

ao 


= — ! 

bo 

Qo 

m 

"3 

3 

Cz 

b\ 

«3 

^3 i 

; C 3 

*3 

«3 


(17-27) 


Proof. Each term which occurs in the original determinant, defined as 
the product of three elements no two of which are in the same row or 
same column, will also occur in the determinant with two columns inter¬ 
changed. But its sign may be altered. As an example of the general 
argument let us take a sixth-order determinant with typical term 
±arb,Ctd^eJ,,. 
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d^ 
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Fig* * 7 * 3 —Interchange of columns (c and e) in a determinant 


Now imagine the columns c and e interchanged: then the element r, 
will move over into the position formerly occupied by e,, and the ele¬ 
ment e„ will move into the position formerly occupied by (Fig. 17.3). 
For the purposes of argument we shall consider the case in which c, is 
above (i.e. t<v): the case in which is below can be similarly 
dealt with. Notice that all the elements a,, b„ in the product 

will remain fixed, except for c, and e„. 

We now perform the constructions to find the sign of the term 
^vfto before and after the interchange. AH the construction 
lines will be unaltered except the horizontal and vertical lines from c, 
and which will now run from the new positions, i.e. from the former 
positions of e, and respectively. Fig. 17.3 shows that the effect of this 
IS to replace the parts of the lines joining to and to e, by new lines 
(shown broken in the figure) joining the old positions of to c, and c, to 
e,: all other lines are unaltered. What is the effect of this on the num¬ 
ber of intersections? There are three possible cases to consider; 


(i) Intersections of lines from c, with those from e^.. Here we see 
that there was no intersection in the original position, but there is one 
intersection in the new position (two lines cross at the former position 
of Ct). This alters the number of intersections by i. 

(ii) Intersections between lines drawn from a„ and i.e. the 

elements which have not been moved. These intersections are unal¬ 
tered. 


(m) Intersections between lines drawn horizontally and vertically 
from one of the elements A., d^ and/,., and those drawn horizontally 
and vertically from the elements c, and Let us call the pair of hori- 
iwntal and vertical lines from the unmoved element (a„ b,, d^ or/,) the 
ines L. Then the intersection number will be changed only if the L 
lines intersect the sides of the rectangle for all the other parts 

of the figure are unchanged. Now any intersections of L with the un- 
broken lines will count towards the sign of the term in the old 

position, and any intersections with the broken lines c.c,. c,e, will count 
in the new position. But if the lines L intersect the rectangle c,e,e..c. at 
all they must intersect it twice, once entering and once leaving These 
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two intersections may count either both to the old position, or one to 
the old and one to the new, or both to the new position: this means a 
change of —2, o, or +2 respectively in the total number of intersections. 
This will be true for every such pair of lines L, and so the total change 
from this source will be an even number. 

Adding together the cases (i), (ii) and (iii) we see that the change in 
the number of intersections is i -}- an even number, i.e. an odd number. 
This means that the interchange of columns has changed the sign of the 
term which was chosen as a typical term. It follows that 

every term suffers a change of sign, and therefore so does the whole 
determinant. 

Expressed in terms of epsilon symbols this means that the inter¬ 
change of two suffixes changes the sign of the symbol: e.g. 

^ 321 ” ^ 231 = —^ 123 ~ — 

since ejgs is i. 

It can be similarly shown that an interchange of two rows also 
changes the sign of the determinant. This also can be deduced from 
the theorem that transposition, i.e. the turning of columns into rows, 
does not change the determinant. The theorem concerning interchange 
of columns then becomes one concerning interchange of rows. 


17.15 Consequences of the column interchange theorem 

From the above theorem we see at once that if a determinant has 
two equal columns it is zero. 



^3 ^3 



(17.28) 


For if we interchange these columns we change J into —J. But we also 
leave J unchanged. So d = —d, i.e. d = o. (Similarly if two rows are 
equal.) 


It follows that we can add to any one column any multiple of any 
other column without altering the value of the determinant; and the 
same is true for rows. For by the rules for addition and multiplication, 
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'rhis rule is very helpful in manipulating determinants. 


• (17-29) 
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EXAMPLE 

12 2 

(i) Evaluate o 2 2 = /I. Subtract the second column from 

303 

120 

the third; this gives 020: now subtract the second row from 

303 

100 

the first, to get 020: finally subtract three times the first row 

303 

I o o j 

from the third, obtaining 020;. None of these operations 

0031 

change the value of the determinant. But the final determinant consists 
of a single term only, iX 2 X 3 = 6 =d. 

PROBLEMS 

103 III 

(1) Evaluate o —2 4 i to cy* 

2 2 2 I I cu^ uj 

I A*- 

(2) Show that the determinant i y is exactly divisible by 

I z 

(x — y) and therefore also by (y — z) and (z — x). 

17.16 Determinants applied to simultaneous equations 
Consider the set of equations 

+ b,y + CjZ = A, \ 

+ b 2 y + I ■ . (17-30) 

-^b^y h^j 

in the unknowns x^y, and z. 

Let us solve these by elimination. We begin by eliminating one of 
the variables: let us say for the sake of argument that the first equation 
IS used to eliminate x, so that the pivotal element is a,x. This means 

that certain multiples of the first equation are added to the other two 
so that the equations become 

“iX + b,y + c^z = ht 
biy+c^z = h^ 

*3'3'+C3's = *3' 
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where b^y etc. are the new coefficients after elimination of x. Now 
by the theorems of the preceding section this addition of multiples of 
the first row on to the others does not affect the determinant of the 
coefficients on the left-hand side; so that 


«1 


Cl fli 


Cl 

^2 


C2 = 0 

i,' 

cf 

«3 

h. 

C3 0 

63' 

cf . 


We now proceed to choose a further pivotal element in the last two equa¬ 
tions; say ^2 using the second equation to eliminate y from the last. 
The equations now reduce to 

diX -h c^z = h 

bi’y+c.’z^h^ 

r " - h " 

63 A# - 

and the determinant of the coefficients is unaltered by this process, so 
that 

d = o 62' fo' ■ • • (17-31) 

o o c.^' 

Now if we apply this to any set of linear simultaneous equations one of 
two possibilities will occur. The first is that the equations are not 
uniquely solvable, but have either no solution or an infinite number. In 
that case we know that at some point in the elimination all the remaining 
equations will reduce to zero on the left-hand sides. This means that 
we have reduced d to a determinant in which one or more rows consist 
entirely of zeros, i.e. d = o. We thus have the result that zohen a system 
of n linear equations in n unknowns is not uniquely solvable^ the determinant 
d of the coefficients ?nust be zero. 

If however the system has a unique solution we can carry through 
the elimination and reduce d to the form shown in equation (17.31)- 
Here a^y bf and cf' are the first, second, and third pivotal coefficients 
respectively. Now since every term in this determinant must be the 
product of three elements no two of which lie in the same row or the 
same column, we see that there is only one non-zero term, and that is 
the product of the elements af)fcf' in the principal diagonal. This 
shows that if the equations are uniquely soluble, the determinant d of the 
coefficients is equal to the product ajy^cf' . . . of the pivotal coefficients, 
and is therefore not zero. 

Thus in the Scheme i elimination of Section 17.4, the determinant 
A will be 5 X 5-0 X —7-68 ^ —192. 

p^otc —The theorem that d = axbfcf *..., the product of the pivotal 
coefficients, will be true if all the pivotal coefficients lie on the principal 
diagonal, which will almost always be the case in practice. But for the 
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sake of accuracy we give here the (rarely needed) correction when they 
do not all lie on the diagonal. In that case the determinant d and the 
product of pivotal coefficients may differ in sign, as the arrangement in 
the final determinant (17.31) may be different. However, this is easily 
allowed for, as by suitable interchanges of columns the pivot can be 
brought onto the principal diagonal. Thus in Scheme 2 of Section 17.4 
we begin by eliminating z in the first equation: the pivot then lies two 
places to the left of the principal diagonal, when we write the unknowns 
in the order x, y, z. It could be brought on to the diagonal by writing 
the z term first in all our equations, and that could be achieved by two 
interchanges of columns: firstly an interchange of the z column with the 
y column, and then the z column with the x column. (We do not need 
to do this, only to imagine it done.) In the second elimination the pivot is 
one step horizontally from the diagonal, and in the third elimination it 
is on the diagonal. (N.B.—In finding the principal diagonal at each 
stage we ignore any gaps in the arrangement due to columns where the 
variable has already been eliminated; such gaps are imagined as closed 
up.) Thus in Scheme 2 the pivots are altogether 2 + 1 4-0 = 3 hori¬ 
zontal steps from the principal diagonal; and since this is odd the correct 

formula to use is d = —product of pivots = —i x (— i) x (_192) 

= —192. The same remark applies to Scheme 3. 


In Scheme 4 to find d we must divide the product of the pivots by 

the extra factors introduced for convenience of solution: d = 3 x 5 X 

116/(3 X 3 ) = 464- "I’hus our methods of solving equations also find 
the determinants of coefficients automatically without appreciable addi¬ 
tional labour. (N.B.—This also provides a general method of finding 


the value of a determinant 


o, A, r, 

^2 ^2 Co 

^3 ^3 C3 


of known numbers even when 


it is not specifically associated with any system of equations. We write 
down the three expressions 


ayX b,y 4 - c^z, a^x 4- b.^y 4 - c^z, 4- + c^^z. 

Then using (say) n,x as a “pivot”, we eliminate a' from the second and 
third expressions by subtracting suitable multiples of the first expression 
a^x 4- b^y 4- c^z from them, exactly as in the process of solving simul¬ 
taneous equations. 'I'he two last expressions will then become 

^2y ^ new pivot is now selected (say A /v if 

6 ^' is not y.QTo) and used to eliminate j/ from the last expression by sub^ 
traet.ng a suitable multiple of b^y + This last expression will 

then be reduced^to, say, e/a. The required determinant is the product 


a^b^Cz” of the pivotal coefficients.) 


Now when A o the equations (17.30) have a unique solution, and 
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this solution can be readily found by the use of determinants. For by 
the rule which permits us to add multiples of one column on another, 


a^x + Aiy +CjZ 

Ai Cl 

1 a^x 

hi Cl 

+ hoy -\- C2Z 

II 

QoX 

As C2 

1 + A3>' -\-C3Z 

^3 ^3 1 

a^x 

A 3 C3 


(subtract y times the second column and z times the third column from 
the first). That is, by (17.30), 

/ii 61 Cl I a, *1 Cl 

h 2 h 2 c 2 1 2 ^2 ^2 ^ 

I 3 63 C 3 ; <23 A 3 C 3 1 

i.e. 

//i lA Cl j j \ ^\ j 

X — Aj A3 Co I / I ^2 A2 - . • (^ 7 - 3 ^) 

A 3 A 3 C 3 j / j «3 A 3 C 3 

The denominator is the determinant the numerator is a similar 
determinant, but with the a column replaced by A’s. There will be 
similar expressions for y and Z'. 

I : Ai Cl i J : Ai Ai 

y ~ 2j ; ^2 Ao C2 j z = rto A2 A2 . (t 7 - 33 ) 

|^ 3 A 3 C 3 , ^3 ^^3 

Note —This gives us the solution of the equations in an explicit 
form. But for the actual calculation of numerical values the methods 
developed in Section 17.4 are to be preferred. 

17.17 Homogeneous equations 

Our discussion leads to an important conclusion concerning “homo¬ 
geneous” equations, i.e. equations for which the right-hand sides Ai, 
A2, A3 are all zero. Such equations have the form 

^1-^' AiV + c^z — o Ai Cl 

«2A' -f A.,y -j- C2« = o d — ; rto As C2 • (i 7 - 34 ) 

rt 3.V -r A3)’ -f c3^ = o a 3 A3 c3 

There is therefore always at least one solution, namely .v = o,y =* o, 
= o. U A ^ o that must be the only solution, whereas if J = o there 
must be an infinite number of solutions. So the necessary and sufficient 
condition for equations (17.32) to have a solution other than .r = o, 
y — o, z --- o is that A = o. 

17.18 Minors and co-factors 

The following definitions are occasionally useful and are given for the 
sake of completeness. 
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fll 6, Cj 

Consider any determinant J, say . The mmor of any 

o 3 ^3 3 

element is defined as the smaller determinant obtained by striking out 
the row and column containing the element in question. Thus the 

minor of a, is we shall denote this by A^'. 

3 3 

The minor of is A2 = A * , with the a column and second 

03 C3 

row removed. 

The co-factor of an element is the determinant obtained from A by 
replacing that element by i, and all other elements of the same column 

I b^ Cj 

by o. Thus the co-factor of a, is o b^ . Now when we e.\- 

I O 63 C3 i 

pand this co-factor we shall have terms consisting of the product of 
three elements, one chosen from each column, and no two in the same 
row. But the only way to obtain a non-zero term in is to choose the 
I out of column i; the elements in the other two columns must be 
chosen from the second and third rows. But this means that, apart 
from the question of sign, the terms in the co-factor Ai are exactly the 
same as the terms in the minor A^\ This is a general argument, apply¬ 
ing to any co-factor and the corresponding minor. Consideration of the 

rule for signs shows (without too much trouble) that the precise relation 
is 

co-factor of any element = ( — 1)' x minor 

where s is the number of (horizontal or vertical) steps between the ele¬ 
ment and the principal diagonal. Thus for elements on the diagonal the 
co-factors and minors agree. For example, for the third-order deter- 
m.nant A = A,' = - b,c,. B, = B= a,c, - a,c,. and 

U3 = C3 = Oibz — hor elements one place off the diagonal the 

minors and co-factors differ in sign, e.g. A^ = —A^ = —(^iC-,—6 c )• 
for elements two steps away they agree again: C, = C,' = albl -a j ' 

1 his rule gives us a method of calculating co-factors. (It also follows 
by transposition that we could equally well have defined the co-factor 
of an element as the determinant obtained by making that element i 
and all other elements of the same row zero.) ’ 

PROBLEM 

(i) Find all the minors, and thence all the co-factors, for the deter¬ 
minants 

* 2 3 531 

214 583 

■710, 32-7. 
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Now the followng relation follows from the addition and multiplication 
rules for determinants 

flj h\ Cj ci\ Cl 0 ij C]_ o bi Ci 

^ On b^ c2 — o b^ cn “h 2 b2 ^2 ® b^ c2 

^3 b2 Cq o 63 C3 o 63 c"3 O2 bj 

= OiAi -h 02-^2 ^^3-^3 - • • • ( 17 * 35 ) 

This is called the “ expansion of the determinant A in terms of its first 
column”. Similarly in terms of the second column we shall have 
J = + bnBz + b^B^t in terms of the third column A = CiCi 

+ C2P2 + ^*303. We can also expand by a row instead of a column: 

A = aiAi ~ hiBi + CjCj, etc. 

It also follows from this argument that if we replace aj, ag, in the first 
column by Z;,, respectively, the new determinant 

hi bi Cl 

hn bn Ca =+ ^'3^3 - • (17*36) 

h 2 b 2 e 3 

Thus the solution of the equations OiX + biy -f- CiZ = /f„ + b^y + 

CnZ = hny fl3.v -\- b^y + c^z = /ig, which we gave in determinant form 
in equations (17.32) and (17.33), equally well be written 

.r = {hiAi ^-h^An + /* 3 ^ 3 )/*d 

y = (hiBi + /72B2 + h^^^lA 
z = {JiiCi -\-h2C2h^C^jA , . ( 17 * 37 ) 

FURTHER PROBLEMS 

(2) Use the results of problem (i) (above) and equations (17.37) 
find the solution of equations (17.14). 

(3) Show that for any (third-order) determinant biAi -^-bnAn + 
bwA^ = o; i.e. the products of the elements of any one column of a deter- 
minant by the co-factors of the corresponding elements of a different column 
sum to zero. 

(4) Show that OiA^ -\-biBy + C1C3 — o, and state the general 
result. 

17.19 General simultaneous equations 

In Section 17.2 we described Newton’s method of successive 
approximation, which can be applied to solve any equation in one un¬ 
known. W'e take any approximation .Vi to the root of the equation, and 
then by applying (17.9) we obtain a second and improved approxi¬ 
mation a* 2. The procedure is repeated with a* 2, obtaining .V3, and so on 
until the correction becomes negligible. 
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The same can be done with equations in any number of unknowTis. 
y, z) = 0, ^(.v, y, z) = o, h{x, y, z) = o be any three simultan¬ 
eous equations in the three unknowns x, jy, Let fx{x, y, z) be the 
partial derivative of /(.v, y^ z) with respect to .v (keeping y and z fixed): 
fv(Xy y, z), f^{x, y, z) etc. are other partial derivatives. Now let us take 
provisional values x^,yj, jetj of x, y, z, and calculate the values of /, g, 
and h and their partial derivatives at (x„ y^, jaJ. We then have for any 
values (x, z) which do not differ too greatly from (.v,, jj, z^) first- 
order approximations 


f(x,y, z) ~ f{x„y„ z,) + {x - x,)Mx,,y„ z,) + (y-y,)Mx„y„ z,) 

+ (c — ir,)Vi, tr.) 

^ [/], + + yf/j, + z[/j, 

\vhere [/]i denotes/(x,, j*,, Zi), i.e. the value of /(x,y, z) at the provi¬ 
sional point (xj, yi, Zj), [/J, similarly denotes /r(x,, jy,, z^) and 
X = X — Xi, Y = y — y^, Z = z — z^. 

In a similar notation 


S(x,y,z)=[g], +X[g;\, + Ylg,], -h 2 [g>]i. 

Now we wish to find values of x, y, z for which/(x, y, z) = g{x, y, z) 
~ h{Xy y, z) = o. This means that to our order of approximation we 
nave to solve the three linear equations 


[/J 

.A' + l 

[/v]i y + 

[A] 

g^. 

i-V + 

[^v 1 y + 

g: 

[AJ 

lA' + 

[/'j, i' + [/<j 


-[gh 

z = ~[h]. 


• (17-38) 


and then t^e Xg — Xj + X, jv'g — yi Y, Z2 = Zi Z as the next 
approximations to the root. It is however rather a nuisance to have 
to solve a set of simultaneous equations at each step: and the following 
modification avoids that. We find the second approximation (x.., jg. -•>) 
from the first, by solving the equations (17.38). If this differs’greatly 
from (xj, y^, z^) we repeat the process, calculating a new set of equations 
and solving them. After a few steps successive approximations will not 
differ very greatly, and the coefficients [/J„ [/J,, etc. on the left-hand 
side of (17.38) will not change very much from stage r to stage (r + i) 

We therefore take them to be fixed at their values in stage r and solve 
the equations 


[/x 

[Vx 

/lx* 


r^ + [A]ry+[/.].Z = 


u 


rX +[gv]rY -\-[g^]rZ =V 
r X Y + [ft^r Z — W 

of ». v and w. following Scheme 3 of Section 17,,. 
1 his will give equations of the form ^ ^ 



+ P2V 
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where Pj, P2, etc. are numbers which we determine by this process of 
solution. The rule for successive approximation is now as follows. Let 
{Xr, yr, ^r) bc thc rth approximation; and let [/]r, k]r. Mr denote the 
calculated values of f(xr, 3V. ^(^r. ^r) and h{Xy, yr, Zf) respec¬ 

tively. Then the (r -h i)th approximation is given by 

X.,, ^ X, - P,[/], - PMr - Pm, 

Jr +1 =3V — Qlif r — QAg\r ~ PsMr 

~r + l “ ^l[/. r ^2[^]r PaMr- 

The process is repeated to determine (Xr+2. >V+2> ^f+2) 

(^r+i» >'r+i» “S^r+i) keeping the same coefficients P„ Pg, P3 ■ • • 
throughout. This is a self-correcting process, so that any small error 
made in the calculation will not affect the final answer. 

EXAMPLE 

(i) We shall use this method to solve the equations /(.v, y) ~ 
X -j- y — 3 = 0 and ^(.v, y) — .yy — 1=0. We start with the trial 
values .Vj — 2, = o. 


Stage 

I 

2 

3 

! 4 

5 


^4 # % # ♦ 

2 

2-5 

2625 

; 2-618056 

2-618034 

yr 


O 

•5 

•375 

; -381944 

! -381966 

If ] 

r = -Vr + Vr - 3 

— I 

0 

0 

i 0 

0 

Mr 

= -Mv-i 

-I 1 

•25 

— •015625 

—•000049 

•000000 

[fr 

^ — I • • . - j 

I 

I 

I 

1 


[a: 

^ “' ■ I * « • • 1 

I 

I 

I 

1 

■ 

1 


r = >V • . 

0 

•5 1 

•375 



gv . 

ss; V 

J " ^ V ^ » • • 

2 

1 

2-5 

2-625 



X , 

• • • • 

•5 

■125 

— •006944 

1 

—•000022 


Vr 

• • • » 

1 

•5 ■ 

1 

1 

—125^ 

•006944 

1 

•000022 

1 

1 

1 



Here A'^, Vr are the correcting terms to be added on to x, and Vr 
respectively to give Xr^\ and Vr-. u ffi® tiext approximations. In the first 
two stages we get X, and Y, by solving the equations 

[f.]rX, + [A]. V. = -[/], 

[gs]rX,-^^[g,],y,^- -[g], 

I lowever by the time we get to the third stage the correcting terms have 
become small: so it is convenient to take the values of [/J, [^J, [/J 
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[^v] fixed from that point onwards at their values in the third stage, 
and to solve the general equations 

•37s + 2 625 yr = V 

These equations have the solutions 

Xr = — I'66667« -f -444441) 

Yr = •66667M — •444441) 

So from that point onwards we take the correcting terms Xr and Yr to 
be 

Xr = —1-66667 [f\r + ‘44444 [^]r 
Yr = -66667 [f\r — -44444 [g]r 

thus shortening the calculations considerably. By stage 5 the differ¬ 
ences between the calculated values of f{x, y) and g{x, j) and the re¬ 
quired values o have become neg.igible: so we can assert that x = 
2-618034, jv = -381966 correct to 6 places. 
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i8.i Definition of a matrix 

A matrix is simply a set of numbers set out in a rectangular array. 
We have met several examples already, for example the array of the set 
of coefficients in a system of simultaneous equations. Any rectangular 
array of numbers can be considered in principle as a matrix, for example 
a two-way classification. Thus suppose that a table has been prepared 
classifying all the students in a college by sex and country of origin, as 


follows: 


English 

Welsh 

Scottish 

Irish 


Men 

313 

37 

62 

IS 


Women .. 

65 

5 

17 

10 


'This is an array of two rows by four columns, or a 2 X 4 matrix. 

'I'licre are certain conventions of notation which are generally 
observed. It is convenient to have a single symbol to represent the 
whole arrav of numbers: as a rule either a capital letter such as ^4 or a 
heavy tvpe letter such as a is used. The individual numbers in the 
matrix are called “elements” and are usually denoted by subscripts, 
thus: 

r ^11 ^12 ^13 ^11 ”1 

'The symbol ‘Vr/’ means “the element in row r and-column s of the 
matrix a". (Note that the first subscript akcays indicates the row, and 
never the column.) The whole matrix a is indicated by enclosing the 
arrav in round or square brackets. If it has iit rows and n columns it 
is called an m n matrix: if in = 11 it is a square matrix of order n. 

Notice carefully the distinction between a determinant and a matrix. 
\ determinant is written as a square array of numbers, but it stands for 
a single number calculated from the array. This is indicated by placing 
a vertical stroke on each side of the array. A matrix stands for an array 
of separate individual numbers, and not one of them can be altered 
without making it into a different matrix: the matrix equation a — b 
means that everv element of the array a is equal to the corresponding 
clement of the array b, i.e. — br, for all values of r and s. This is 
indicated by enclosing the array in round or square brackets. The dis¬ 
tinction is somewhat like that between the “population of Britain”, 
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meaning a single number, say 50,387,642; and the “people of Britain”, 
meaning 50,387,642 separate and distinctive individuals. Furthermore 
a matrix need not be square. It is evident that from any square matrix a 
we can calculate a determinant; this is usually denoted by det a or \a\. 
But this determinant is not the same as the matrix. 

It is also convenient to call the matrix of m rows and n columns, all 
of whose elements are zero, the “ zero matrix ” or if there is no 
danger of confusion, simply as O. 


18.2 Transposition of a matrix 

If we rewrite our classification of College students with the sex 
running horizontally and the nationalities vertically we obtain a new 
matrix, in which the columns of the original one have been turned into 


English 

Welsh 

Scottish 

Irish 


Men 

Women 

3*3 

65 

37 

5 

62 

17 

15 

10 


rows, and the rows into columns. This is called the “transpose” of the 
original matrix: the transpose of a is usually written a\ or sometimes 
as aT It is clear that the transpose of an tn x n matrix is an n x m 
matrix: in terms of the elements the operation of transposition can be 
written 


^ rs • . . (18.l) 

(c.g. the element <2,3 of the old matrix becomes the element of the 
new one.) It is also clear that to transpose the transpose returns us to 
the original matrix 

(«')' = a ... (18.2) 

The theorem that the transposition of a determinant does not affect its 
value can be written 


det a = det a 


(i8-3) 


18.3 Addition of matrices 

The University of Edinford contains three colleges, St Patrick’s 
Narkover, and Queen Anne’s. The student populations of these three 
colleges and the total university are given by the following four matrices, 
which we call a, A, c and r respectively: 




r Men 
\ Women 

English 

Welsh 

Scottish 

Irish 

(0) 

St Patrick’s 

313 

65 

37 

5 

62 

'7 

15 

10 

(b) 

Narkover 

r Men 
\ Women 

25 

7 

7 

2 

IS 

13 

4 

c> 

(c) 

Queen Anne's 

/Men 
\ Women 

13* 

40 

6 

2 

*5 

31 

91 

8 

(t) 

University 

r Men 
\ Women 

470 

112 

50 

102 

6t 

110 

27 
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It is then natural to say that the matrices a, b, and c sum to the 
total and to write 

a + A + c = r 

In this “matrix addition” the corresponding elements are added, i.e. 
^11 = dll + *11 + <^11 = 3^3 + 25 + 132 = 470, and in general = 
art + *r4 + ^rs' is clcar from this definition that the addition can 
be performed in any order; a+A+c — c+A+fl = c+ c+A. 

“Matrix subtraction” is similarly defined by the subtraction of the 
corresponding elements: if a — b — d then art — *r« — From 
these definitions it follows that 

a — a~ O . (18.4) 

(For art — = o) 

a = . . (18.5) 

(For a,., -h o = a„) 

a — A + A = a . - - (i8.6) 

(For art — brs + A„ = flrs- This means that subtraction is the 
reverse of addition.) In short, matrices behave like ordinary algebraic 
quantities as far as addition and subtraction are concerned, when we use 
these definitions. There is also the extra rule for transposition: 

(a + A)' == a' 4 - A' . . . (18.7) 

Notice that the rule for addition of matrices is quite different from 
the rather artificial rule for the addition of determinants. If a + A = c 
it does not follow that det a + det A = det c. Notice also that we can 
only add matrices that have the same shape, i.e. the same number of 
rows and columns. 

18.4 Multiplication by a number 

We shall say that a matrix a is multiplied by a number k if every 
element of a is multiplied by k : 


r ^11 ^12 ^13 

1 = r 

kai2 

Aai 3 "1 

L ^21 ^22 ^23 _ 

] L ^^^21 

kci 0 2 

*<^23 J 


Any matrix can therefore be multiplied by any number. In suffix 
notation the definition runs: 

if ^ = ka, then A„ == (for all r and r). 

This product has the following properties, which are readily proved 
from the definition: 
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a -h a = 2a 
Aa + Afl = (A + /t)a 
ha — ka — {h — k)a 
h{ka) = {hk)a 
k{a + b) = ka kb 
{ka)' = ka 
la = a 

oa = 0 . . . . (18.8) 

In brief it behaves according to the usual rules. It is customary to 

denote the matrix ( — i) a by — a: this corresponds to chaneinc the sif?n 
of every element. 

- [-: a - [■: i] 

18.5 Symmetric and antisymmetric matrices 

A square matrix a is said to be “symmetric” if it is equal to its trans¬ 
pose a', i.e. if In other words, it is not altered by reflection 

in the principal diagonal, e.g. Q is a symmetric matrix. 

An “antisymmetric” or “skewsymmctric” matrix is one for which 
i.e. ars~ —a>T% as for example ^ ° Since it fol¬ 
lows from this definition that we have a,, = o for all r, 

I.e. all the elements in the principal diagonal must be zero. 

Since (a + b)' = a' + b' the sum of two symmetric matrices is sym¬ 
metric. and the sum of two antisymmetric matrices is antisymmetric 
Also the product of a symmetric or antisymmetric matrix by a number 
IS respectively symmetric or antisymmetric. 


18.6 Multiplication of matrices 

One might expect, by analogy with addition, that two matrices would 
be multiphed by multiplying corresponding elements. But that is not 
the definition used in practice, for the reason that there is another 
dehnition which proves to be much more useful. 

Consider the following situation. In the University of Camburgh 
there are three biological faculties, namely Botany, Zoology, and Medi¬ 
cine, and the lecturers in these faculties are distributed as follows- 


Botany 

Zoology 

Medicine 


I 

5 

7 






4 
2 

5 


3 

I 

4 


- t 

Wc shall call this matrix a. Now a questionnaire showed that every 
mghsh lecturer had two sons and one daughter, every Scottish lecturer 
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had one son and two daughters, and every Irishman three sons and three 
daughters. This can be represented by a second matrix b. 



Sons 

Daughters 

English 

2 

X 

Scottish 

I 

2 

Irish 

3 

3 


From these matrices we can find the number of sons and daughters 
in each faculty. For example in Botany there are one Englishman with 
two sons, four Scots with one son each, and three Irishmen with three 
sons each, making a total ofiX2+4Xi+3X3 = i5 sons. 

In this way we get the following matrix p: 



Sons 

Daughters 

Botany 

15 

18 

Zoology 

15 

12 

Medicine 

3 » 

29 


This matrix p is called the “product” ab of the matrices a and by 
and is written 



An element of p, say/>r.5 in the rth row and sth column, is obtained by 
multiplying each element in the rth row of a by the corresponding ele¬ 
ment in the ^th column of b and adding. Thus the element p\2— 
obtained from the first row of a and the second column of b\ 


= iX I + 4X2-1-3X3 = 18 

or in suffix notation 

^11^12 "h ^12^22 d” ^13^32 fZ 10^02 

a 

and in general 

pTS = H- «r 2 ^ 2 * + ^ • (18.9) 

a 

'I'hus it is only possible to multiply two matrices when the number of 
columns in the first is equal to the number of rows in the second: and 
in general the product of an (J X ni) matrix a and an (m X «) matrix b 
is an (/ X w) matrix ab. 

Note .—When in practice we have to multiply two matrices we shall 
follow the rule given above. But as it is rather difficult to run by eye 
across the row of one matrix down the column of the second at the same 
time, it is better to modify the arrangement slightly (Fig. i8.i). 
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Total 

^ 4 3 fiS 33 

a = 521 aA = 15 12 27 

7 5 4 L31 29 J 60 


b’ = 213 

I 2 3 

(336 Total) 

Fig. 18.1—Practical multiplication of matrices 

The matrix b is written in transposed form b' underneath the matrix a. 
The elements in the first row of ab are then obtained by multiplying and 
adding the first row of a by the first and second rows of b' in turn, i.e.: 

first row of a X first row of b' 

1x2+4x1+3x3=15 

first row of a X second row of b' 
1X1+4X2+3x3 = 18. 

As a check on the arithmetic we can add a “total” row under b\ 
which is simply the sum of all the rows of b'. This gives on multipli¬ 
cation by any row of a a “total” for that row of the product ab\ and if 
there are no errors of computation this should be equal to the sum of 
the elements of that row in ab. Thus for the first row the total is 
1X3+4x3+3x6 = 33=15 + 18 (check). The second and 
third rows are similarly calculated and checked. 


Now suppose that we know further that the assistant lecturers are 
distributed ^ong faculties and nationalities as follows: 


English Scottish Irish 

Botany .. 

Zoology 
Medicine 



We shall call this matrix A. Then a + A represents the combined distri¬ 
bution of lecturers and assistant lecturers, or, say, of “junior staff”. 
Now if it is true of assistant lecturers as of lecturers that every English¬ 
man has two sons and one daughter, every Scot one son and two daugh¬ 
ters, and every Irishman three sons and three daughters, then the distri¬ 
bution of children of assistant lecturers among faculties would be given 
by the matrix Ab, i.e. ^ 


Botany . . 

Zoology 

Medicine 


Sons 

Daughters 

5 

7 

7 

8 

12 

12 


• # 


# $ 
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Now we can calculate the distribution of children for the combined 
junior staff in two ways. We can add the separate distributions for the 
children of lecturers and assistant lecturers; that is, we add the matrices 
ab and Ab. Alternatively we can take the matrix (a + A) representing 
the distribution of combined staff among nationalities, and multiply it 
by by which represents the number of children for each nationality 
separately, obtaining (a + A)b. The final results must be identical, i.e. 

(a + A)b = ab Ab . . . (18.10) 

This relation can also be proved by use of suffix notation. If m = 
(a 4- A)by then A^)bas = ^ drahas + ^ A^abas', but 

^ Oraba, is the element of ab in row r and column j, and 2* Araba$ the 
corresponding element of Aby i.e. = {ab)^, + (Ab)rs in a ^irly 
obvious notation. So w = aA + Ab. 

We can show in a similar way that, for any B, a(b B) = ab aB. 

Now suppose further that every son of a member of the staff pos¬ 
sesses one pet rabbit and one kitten, while every daughter possesses two 
kittens, but no rabbit. We can represent this by a further matrix c: 

Rabbits Kittens 

Son .. .. I I 

Daughter .. o 2 

Then the matrix product q = be gives the total number of pets in a 
family according to the nationality of the father: e.g. an English lecturer 
has two sons and one daughter, and therefore 2 X i-hiXo=2 
rabbits and 2Xi+iX2 = 4 kittens. The matrix aq = a(bc) will 
therefore show the number of pet animals in each faculty. But this 
distribution can also be obtained by multiplying the matrices p ^ ab 
connecting faculty and number of children, and c representing the 
number of pets of each child. Thus 

a{bc) — {ab)c . . . (18.11) 

PROBLEM 

(i) Verify this identity by actual multiplication of the matrices a, 
by and c. 


Equation (18.11) can also be established by the use of suffixes. Let t 
denote aq = a{bc) and u denote pc — (ab)c. Then 

tfs ~ 2 a fa (Jas = 2 a^a (2 ba^ ^^ 5 ) 

a a /9 

^ 2 {^a^a ba^ ^ 5 *) 

0,3 
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«ra = ^Pr^ = T {£0^0 I'a^) C^, = Z (f 7 ^ baS CR,) = tr, 

P P a.^ 

i.e. u ~ t, {ab) c = a {be) 


So far matrices have obeyed exactly the same rules of operation as 
ordinary algebraic symbols. There are other analogies. For example, 
let /„ be the square matrix of n rows and w columns whose elements are 


I in the principal diagonal, and o elsewhere, e.g. /j = [i], /^ = I * ® 

L® ^ 

etc. Then if a is an m x n matrix, al„ = I„,a = a. (E.g. if every son 
has one rabbit and no kittens, and every daughter has no rabbits and 
one kitten, then the distribution of sons and daughters is necessarily 
identical with that of rabbits and kittens respectively.) The matrices /„ 
therefore beh.ave like the number i as regards multiplication, and are 
called “unit matrices”. It is usual to write them as /, leaving the suffix 
n to be understood. This leads to no ambiguity, since in a product al 
the number of rows of /is fixed as being equal to the number of columns 
of a by the rule of multiplication. The elements of /are usually denoted 
by the symbol 8,., (the “Kronecker delta”) rather than thus 8 ,., = o 
if r ^ s, and 8^,. = i (by definition of /). 

There is one important respect in which matrix multiplication diflfers 
from that of ordinary numbers. The products ab and ba are not neces¬ 
sarily equal: multiplication is not “commutative”. For example: 




(and indeed the product ba will not exist at all unless b has the same 
number of columns as a has rows). Thus in multiplying (a + b){a~b) 
we must remember to keep all products in correct order as follows: 
(a + *) (a — 6) = a{a ~ b) + b{a — b) = aa — ab + ba ~ bb\ this is 
not usually equal to aa — bb. It is usual however to write aa as 
aaa as a®, and so on: there is no question of order of factors involved.* 
In the same way {a + by = {a + b) {a b) = a- + ab + ba b^ 
and this cannot be simplified. 

If we transpose the matrices a and A, what happens to their product? 
In the product ab we multiply the rows of a into the columns of b. But 
the rows of a become the columns of a\ and the columns of b become 
the rows of A': so the elements of ab become elements of the product 
ba. In particular the row r-column s element of ba is obtained by 
multiplying row r of A' by column f of a and summing, that is, by 
multiplying column r of A by row s of a, and so is equal to the row 
column r element of ab. So b'a' is the transpose of ab. 



Similarly {abc)’ — cb'a. 


(18.12) 
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FURTHER PROBLEMS 

(2) Expand (a + How many terms are there in this expression? 

(3) Find {a + b)(a — b) — {a — b) {a + b) 

(4) Find [a + A) (a — b) {a b) — {a — b) {a + 6) (a — b). 

Note. —The reader may consider the definition of multiplication of 
matrices given above a little artificial; it is absurd, he will say, to sup¬ 
pose that all Englishmen have exactly one son and no daughters; the 
number will vary from family to family. But if we took the matrix b to 
represent the average numbers of sons and daughters—or the average 
numbers of children with red and brown hair—or any similar characters, 
then the matrix ab will give us the numbers of sons and daughters (or 
red- and brown-haired children, etc.) we would expect to find on the 
average, given the matrix a of observed numbers of nationalities and 
faculties: and this sort of argument may be quite useful. 

18.7 Vectors interpreted as matrices 

Suppose we have a vector OF = v in space. Let us take three co¬ 
ordinate axes X, y, and z in space with O as origin. Draw PQ perpen¬ 
dicularly from P onto the plane containing the x and y axes, and QR 
perpendicularly from Q onto the .v-axis (Fig. 18.2). Then the vector v 



Fig. 18.2—The three components of a vector in space 

is the sum of the three vectors OR, RQ, and QP. Now the distance OR, 
taken with the proper sign, is by definition the component Vi (say) of v 
in the direction of v; RO is the component 1*2 in the ;y-direction, and 
OP is the component 1*3 in the s^-direction. If we construct three vec¬ 
tors Ml, M2, M3 of unit length pointing along the x, y, and z axes respec¬ 
tively, then vcctorially OR ~ t'lMj, RQ = ~ 

V = OP = OR -f- RQ 4- QP vectorially, i.e. 

V = z'lUi v^Us 4 - t-'gMg . • (18.13) 
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Thus the vector v is completely determined by its three components 
Viy V2> and (which are three ordinary numbers). If we write these 
three numbers in a row we obtain a i X 3 matrix [t?i, Vo, t’a]- this is 
the matrix corresponding to the vector v. Each vector has such a 
matrix, and each such matrix determines the vector uniquely. 

Now when we add vectors, we add their components (Section 14 5); 
if w is another vector with components [a?,, W2. then v -r w has 
components [Vi + ^0 + W2, 4- tt’a]- But by the definition of the 

addition of matrices this is the sum [t’,, ^2. ^‘•’2* ^3]- So the 

addition of vectors corresponds to the addition of their matrices. 
Similarly we know that the product kv of the number k and the vector v 
has components [kv-^, kv^, and this matrix is simply the matrix 

V2, of V multiplied by k. Also the zero vector O has zero com¬ 
ponents, and so corresponds to the zero matrix [o, o, o] = O. 'I'hus 
every operation on the vectors v is exactly matched by the corresponding 
operation on the 1x3 matrix [r,, ^2, of components. So close is 
this analogy that it is customary to call the matrix [t j, ^’3] ^ “vector”. 
If it is desired to make a distinction between the directed magnitude v 
and the matrix [t’j, ^^3] 've could call the first a “space-vector” and 

the second a “row-vector”: but the correspondence between the two is 
so close that usually the same word “vector” will serve equally well for 
either. In fact we go further, and call any matrix consisting of a single 
row of numbers a “vector”, however manv numbers there are in the 
row. Similarly any matrix consisting of a single column only is called a 
“column vector”. If a row or column vector contains three numbers 
only, v^y and v^y it can be represented geometrically by the vector v 
in space which has the components z»,, v^y and ^3. This vector v forms 
a picture of the matrix. If the vector contains only two numbers, 
and ^>2» it can be represented by a vector in a plane. Unfortunately it 
it has more than three components we can form no such picture. 

A row vector can easily be written down as [t-i, ^ t’3, . . A 


column vector can be written in column form 



but if it contains 


many elements this form occupies a great deal of space. I'hu.s for typo¬ 
graphical reasons it is simpler to write this as v.,]'y using the notation 
for transposition; and any column vector can be written as the trans¬ 
pose of a row vector, e.g. [i, 4, 2]' stands for the column vector or 3 X i 
matrix with elements i, 4, 2 in order reading downwards. (A. C. Aitken 
uses the notation { 1, 4, 2}). So to economize space we shall in future 
write every column vector in this form. 


18.8 Complex numbers interpreted as matrices 


Consider the 2X2 matrices I — 
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Then direct multiplication shows that P — II ^ /, Li = /./ = i, and 
i- = — I. Here I is the unit matrix, which resembles the number i in 
that Ix — xl = X for any (2 X 2) matrix x. i is therefore a sort of 
matrix square root of minus i. The complex number z = x iy can 


be represented by the matrix z = ;e/ + j'l = ^J L V oJ ~ 

^ and this matrix will behave with respect to addition and 

—y 


multiplication exactly like the complex number z. For instance 

(.vj + yj) + (X2I + yzi) = xJ + X2I -\-yji -hyz* = (•»! + *2)/ + 

(>*1 +^2)*! whereas we know that the complex numbers (xj -f- iVi) and 
(^2 + O' 2) add according to the rule (.V, + zy,) -f (:v 2 + ty 2) = (^1 + ^2) 
+ i'CVi + yz)- Similarly {xj +3'ii) {^Vo/ +^20 = + Xiyzli + 

yiXzil + >’1^2*" = (^iA’2 — 4 - (^1^2 + yiXz)*, while the product 

of {Xi + z>i) and (x^ + *>2) (.Vi-Va —yiyz) + (*i>’2 +>’i^2)«* Thus 
these matrices give us an alternative interpretation of complex numbers. 


18.9 Simultaneous equations in matrix form 

Suppose we have two simultaneous linear equations in two unknowns 
X and y: 

a^x H- b^y = 

+ b^y = /Z2 


Then the rule for multiplication of matrices shows that these can be 
equally well written as 


[:: [;]=CO 

or if we denote the matrix of coefficients ^ M by a, the column 

L«2 ^l’J 


vector [x, y]' by x\ and the column vector [/zj, lu]' by h\ this becomes 
ax' — h\ a very concise form. This notation extends to equations con¬ 
taining any number of unknowns. 


18.10 Division of matrices 

We know that the equations ax' = h' are uniquely solvable if and 
only if the determinant of a is dilferent from zero. We can extend this 
result. Let a be any given square m x m matrix, and b any m X n 
matrix. Then if the determinant of a is different from zero there is one 
and only one matrix x such that fl.r = b: if det a = o this is not so. 
(By the rule for multiplication x must be an m x n matrix.) For con¬ 
sider the 5th column of x, consisting of the elements .Vu, Jfg* • • • 
.Y„,s in order, and the 5 th column of 6, consisting of the elements bu, 
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^2a> • • • The equation ax = b implies that these are related by the 

m equations 

“t“<^12A'2s + . . . ^ ms 

^21 -■'■’l a H- <2 2 2 •*’23 + - • • + «2m -'‘ms — ^aa 

^ml ^Is ^m2 -^23 “I" • • • ~i“ ^mm -^ms ~ ^ms 


and these are uniquely solvable for the unknowns a*,,, .Vj, . . ..v„, if and 
only if the determinant of a is not zero. 

In particular it follows that if det a ^ o we can find just one matrix x 
such that ax = I. This matrix is called the “reciprocal” or “inverse” 
of a and denoted by ; so 

aa-^ = l . . . (18.14) 


A matrix a whose determinant is non-zero is often called “ non¬ 
singular”, so this definition shows that a non-singular matrix has a 
unique reciprocal. We shall see later that a singular matrix, with zero 
determinant, cannot have a reciprocal. 

Now if det a ^ o the equation ay = a must have a unique solution 
y. But one possible value of y is /, and another is a~^ a, since a{a^^ a) 
= (a a-^)a — la = a. It follows that, since the solution is unique, 


a“* a = I = a a~^ 


(18.15) 


We can now give the general solution of the equation ax = b. 
Multiply this equation on the left-hand side by a*"*; we obtain a-^ ax 
= a~^ b. But ax = Ix = x\ so the solution is x = a-^ b. Simi¬ 
larly the solution of the equation Xa = b X = ba~^. 7 'hcse two 
expressions a~^ b and b a~^ can be considered as the analogues of the 
ordinary quotient bja', but with matrices there are two distinct quotients 
because the order in which multiplication is performed is now important. 

In particular the solution of the set of simultaneous equations 

ax' — h' is .r' — a“* h'. 

Note also that 


(A-* a ^) {a b) = («'* o) A = lb =» A-‘ b -- I 

so that A”’ is reciprocal to ab\ 

(oA)-» = A-> 0-1 . . (18.16) 

Similarly (aAc)-i = c"* A-* a-i, provided that a, A, and c arc all non- 
singular. 


PROBLEMS 

(1) Prove that (a')-* = (<*~‘)^ 

(2) Prove that {(aAe)-*}' = (a"*)' (A-')' (c"')' 
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[8.II Inversion of a matrix 

In Scheme 3 of Section 17.5 vve showed how to solve the equations 

«31-'V + H- «33^= - • ( 18 - 57 ) 

where the 0^ are known numbers, the x, j, and z are unknowns, and 
M, and w are general symbols. If the determinant of the is not 
zero these equations have a unique solution, which can be written in 
the form 

a: = Cii « + ^12^ + ^13 a' 
y = e^i it + ^22 ^ + ^23 ^ 

^ = ^31 M + ^32 W + £33 W . . (18.18) 

where the Crg are numbers which we find in the course of the compu¬ 
tation. These equations can be written in matrix form 

ax' — 11 '; x' = eu' 

where x' stands for the column vector [.%*,>', z]', and u for [«, v, tv]'. But 

we know that the solution of ax' = «' is //' = x'. So e is the 

reciprocal matrix a~^. This gives us a practical method of finding the 
reciprocal of any given (non-singular) matrix. 

For theoretical purposes we turn to the properties of co-factors dis¬ 
cussed in Section 17.18, and in particular to equations (17.37). In the 
present notation these can be written 

.V == {Ai^u A.iV + AaiW^A 
y ^ {Ai^n A.^v + A 32 P)jA 

z = {AisU -{- A.zV A^:iZo)IA 

where Arg denotes the co-factor of a^g in J, the determinant of a. (Our 
symbols u, r, zv are equivalent to /zj, /^g, of equation i7'37-) ^ 5 " ^f ^ 
is the matrix of co-factors A^^^ x' = {A'u')IA. But this must be equi¬ 
valent to the solution x' = a~^ u'; so 

a~^ = A'IA . . . (18.19) 

Since the co-factors A^ are defined by means of certain determinants 
obtained from the matrix a, this is in effect an explicit expression for 
the reciprocal a“'. 

18.12 Multiplication of determinants 

The determinant A which has elements a,, the first column, 

ft,, fto, fta in the second column, and Ci, tg* ^3 iri the third, is defined as 
^ da ft^ Cy, where the €a^y are certain numbers taking only the 
values — I, o, and i. This in effect is our definition of a determinant: 
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the epsilon symbol is merely put in to give the correct sign to the pro¬ 
duct Now in double suffix notation this definition becomes 

^ = ^ fI < 7 ^ 2 3 • • • (18.20) 

a.^,T 

and a similar definition holds for matrices of higher order. In the same 
way the determinant 

^ir 

^rst ^2r ^2s ^Sf 

^ 3 r ^ 3 b ^ 3 t 

= ^ Qar a^s a^t . . . (l8.2l) 

almost as a matter of definition. Here ^rst means the determinant whose 
first column is equal to the rih column of the original determinant J, and 
whose second and third columns are respectively the 5th and fth columns 
of A. Thus 

^123 . . . (18.22) 

and if we interchange any two of the suffixes r, s and /, we interchange 
the two corresponding columns in and therefore change its si<yn 
(Section 17.14). 

Afgi ^art — Agif, etc. . . . 

Consider first the case when r. s, and t are all different. They must 
then be the numbers i, 2, and 3 arranged in some order: so that €rst 
= ±1. We also have shown in Section 17.14 that interchange of two 
suffixes in an epsilon symbol changes the sign of the symbol: = 

^ert — ^tc. Tlic quotient A^gt/^rti will accordingly be unaltered 
by an interchange of two suffixes: A^g^j^.g, = = 

etc. Now the suffixes r, 5, t are simply the three numbers i, 2, and 3, 
though possibly occurring in a different order. By a sufficient number 
of interchanges we can restore them to the natural order 123 (e.g. if 
we start with r 5 / = 2 3 i we can interchange 5 and t, getting r i s = 
2 I 3, and then interchange r and t, getting t r s = 1 2 So d .!€ . 
='di 23 /ci 23 = d/i (by 18.22) = J, i.e. 

Afgi — A . €^g^ , . _ (18.23) 

The other case which may occur is that in which two of the suffixes 

r, s, t are equal. But in that case A^,^ is a determinant with two equal 

columns, and is therefore zero; and = o by definition whenever 

two suffixes are equal. So (18.23) is true in all cases; that is, for all 
values of r, 5, t, 

-2^ fa;?;-tfar= (det a) . . (*8.24) 

Now let a and b be any two square matrices of three rows and columns • 

let c be the product ab, so that r,. = 6 a,. Then the determinant 

01 c IS by definition 
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det (ai) = det c = ^ ^/92 ^rs 

= S ^Ai) (-^Vz) 

Oy^.X A V 

= S ^oA ^al ^/*2 ^•’z] 

= S S ^tiv • ^Ai ^^2 

A.^.k a./ 9 .r 

= ^ [det ti . cA/ir ^Ai ^1^2 ^•'sl (using 18.24) 

\,H,V 

= det a. det 6 .. - • • (18.25) 

The determinant of the product of two square matrices is the product oj 
their determinants. 

The proof we have given for 3 X 3 matrices readily generalizes to 
arbitrary order n. 

Corollary i—The unit matrix I has determinant i; for since Ix = x 
for any matrix x, det x — det (/r) = det I det x. If we choose for x 
any matrix with non-zero determinant this shows that det /== i. 

Alternatively it is evident from the definition of a determinant that 
the only non-zero term in det I is the product of the diagonal elements, 
which is I. 

Corollary 2—The determinant of the reciprocal a~^ is the reci¬ 
procal of the determinant of a. For from a~^ a = I we have (by 18.25) 
det (a-^) . det a — det /= i. 

Corollary 3—A singular matrix a has no reciprocal a“h For det a 

o; but if there was a reciprocal a“' it would have a finite deter¬ 
minant det (a"^), and this is inconsistent with the equation of corollary 2, 
det (a“^). det a = i. 

18.13 Latent roots and latent vectors 

In the Wright-Haldane-Fisher theory of inbreeding, in Fisher’s 
theory of discrimination, and in Hotelling’s “canonical correlations we 
meet with the following problem. Given a square matrix a can we find 
a non-zero column vector v' with the property that multiplication of v 
by a is equivalent to multiplication of v' by an ordinary number A? 
That is, can we find v' and A such that 

av’ = Xv' . . . (18.26) 

(It is clearly essential to specify that v' ^ O', for when v' = O any 
value of A satisfies this equation.) Such a number A is called (by differ¬ 
ent writers) a “latent root”, “eigen value”, “characteristic root , 
“characteristic value” or “proper value” of the matrix a, and the vector 
v' is called the corresponding latent vector, eigen vector, or character¬ 
istic or proper vector. The problem also occurs in some applications m 
a more general form; if b and c are two square matrices with the same 
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number of rows, can we find a number A and a non-zero vector v' such 
that 

bv' = \cv' . . . (18.27) 

If c is non-singular (as is always the case in practical applications) 
the equation (18.27) t)e reduced to (18.26). It is only necessar>' to 
multiply both sides of the equation by c~^; 

c~^ bv' = Ac“* cv' = A/v' = Av' 

so that if we put a = c~^b then v' is a latent vector of a and A is the 
corresponding latent root. 

We shall therefore consider this problem: given a matrix a, can we 
find A and v' ^ O' to satisfy equation (18.26), av' = Av'? A complete 
solution can always be obtained by the following method. Since Iv' 
= v' the equation can equally well be written av' = A/v’, or 

{M — a)v' = 0 ' . . . (18.28) 

We can deduce from this that the matrix (A/ — a) has zero determinant 
(see Section 17.17). For if it had not, there would be an inverse matrix 
(A/ — a)“h and (18.28) could be multiplied through by this inverse to 
give v' = (XI — a)“* O' = O', contradicting the assumption v' ^ O'. 
Thus if a is an n x ft matrix we see that it must be true that 

det(A/— «)= (A—flu) — a, 2 — ajs ... —j = o (18.29) 

^21 ('^ ^22) ^23 • • • —^ 2 n ; 

^31 ^32 (A — 033). . . —< 23 „ 

••• ••• »*• 

^nl 2 ^n3 • • • (A —^nn) 

If this determinant is written out in full this becomes an equation for 
the unknown “latent root” A, which can be solved by any convenient 
method to give all possible values of A. If A, is any such value we can 
certainly find at least one non-zero vector v' such that (A,/ — a)v' = O'. 
For we have merely to solve the equations (A,/ — a)v' — O'. This is 
merely a shorthand way of writingsimultaneous equations in the « 
unknown quantities [vi, v^, . . . the components of the vector v; 
they can be completely solved by the methods of Section 17.4. Since 
the determinant det (Aj/ — a) is zero these equations must have either 
no solution or an infinite number: but they certainly have one solution 
v’ = O', and so they have an infinite number of other solutions, each 
of which is a latent vector v corresponding to the latent root Aj. 


EXAMPLES 

(i) Find the latent roots and vectors of the matrix a = 
(considered in the theory of inbreeding, Section ig.ii). 

MM 
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The equation (18.29) for A becomes 

A- I o 

o A — i o =0 

o — i A — I 

This becomes (A - i) (A - i) (A - i) = o when written out in 
full, since all other terms in the determinant are zero. Thus there are 
two latent roots, Ai = ^ and A2 — i* First consider the root Aj -J. 
the equation (A/ - a)v' = O' for the latent vector v' becomes accor¬ 
dingly 

„ I o ”1 rv~ 

o O 

O -i i - I J L«'2- 

or in expanded form 

— ii’l — il>2 + OV3 = o] 

+ 0^2 + 0^3 “ O ^ ■ ■ • (^^*30) 

OVi — Jt’2 — i ^'3 = o) 

T*he solution of these equations is evidently Vi = /?, 1^2 — -~2k, ©3 = 
k, where k can be chosen arbitrarily. Thus the general latent vector 
»»'(]) corresponding to the latent root A| = i is V(]) = [^, 2ft, «]. 
If we take the second latent root Aj = i the equations for v' become 

0^1 — H's + OD3 = o| 

o?;i + \V2 + OV3 = Or 

OVi — IV2 + 0V3 = o) 

and the general solution is = /, zj 2 == o, 1^3 = where / and m can 
be chosen arbitrarily. Thus the latent vectors »’(2)^ corresponding to 
the second latent root Ao = i are of the form [/, 0, ffi] . 

(2) Find the latent roots and vectors of the “diagonal” matrix 



where all the diagonal elements . . . S„ are assumed to be un¬ 

equal, and all the elements not on the diagonal are zero. 

The determinant det (A/ — D) becomes 


(A - Si) 

o 


(A-S2) 



I 


o 


o 


• • • 
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that is, (A Sj) (A — Sg) . . . (A — S„) = o. There are accordingly n 
different latent roots. Aj = S^, Aj = Sg, Ag = S3, . . . A„ = S„. To 
find the latent vectors v corresponding to the latent root Aj = Sj it is 
necessary to solve the equations (5j/ — D)v' = 0\ i.e., 

OVi + OV2 + OV3 + . . . + ov„ = o 

0^1 + (Si — Sa)^2 + ovs + . . . + ovn = o 

0^1+ + . . . + (Sj — B„)v„ = o. 


Since (S, — S2), ( 5 , — S3), . . . (Sj — S„) are by hypothesis all different 
from zero it follows that V2 = . . . = v„ = o, while t;, = k, say. 

and can be chosen arbitrarily. Thus the latent vector is of the form 
[/j, 0,0, . . ., o]. 

Similarly the general latent vector corresponding to the latent root 
A2 = S2 is [o, A, o, . . ., o], and so on. 

This method of finding the latent roots and vectors can always be 
applied in theory. A method which is often more convenient for 
numerical computation is described later (Section 18.15). 

Besides the latent column vectors v' of a matri.x a it is also possible 
to define “latent row vectors” u satisfying the equation ua = ^u, or 

u{M — a) = 0 . . (18.31) 

The first step in solving this equation will again be to sec that 
det (A/ — a) = o. This is the same equation as (18.29); so the latent 
roots associated with the row vectors are the same as for the column 

vectors. When A has been determined, u can be found by solving 
(18.31). ^ 

By transposition of each side of (18.31) we find, using (18.12) 

(A/' — fl') u' = O' 

or, since /' = /, 

(A/ —fl')„' = O'. 

Thus if « is a latent row vector of a then «' is a latent column vector of 

a, and conversely. In particular if a is symmetrical, that is a — a 

(by definition), the latent row vectors are merely the latent column vec¬ 
tors transposed. 


PROBLEMS 

(1) Find the latent roots, column and row vectors for the matrix 

[;;]. 

(2) Find the latent row vectors of the matrix a of example (1) above. 
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{3) What happens in the case of the diagonal matrix D of example 
(2) when not all the elements are unequal? 

(4) Show that det a — o if and only if o is a latent root. 


18.14 Properties of latent vectors 

Let the equation (18.29) be written out in full; it will become 

A” +/»n-i A"-' + . . . +i>iA^ +i>o = o . . (18.32) 

where 2, . ■ 0 are rather complicated expressions containing 

the elements of the matrix a. For the term A'‘ can occur only in the 
product of the diagonal elements, and all other terms will contain lower 
powers of A. This equation is called the “characteristic equation" of 
the matrix a. In Section 15.5 it was shown that every such equation 
has at least one root, possibly complex, and as a rule it will have n roots. 
Thus any (« X n) matrix will have at least one latent root, and corres¬ 
ponding column and row vectors, and in most cases it will have n latent 
roots. 

Consider now the usual case in which there are n latent roots, say 
A], Aj, . . . A„. Corresponding to each root A^ we can find an infinite 
number of latent vectors; choose one of these, and call it v^r), so that 


a\\r)' = A,.V(r)' . • • (18.33) 


Now write out the vector in full as, say, [Fjr, Vzrt • • ■> ^nr]'- 
all these numbers F,., will form a matrix V whose rth column is the rth 
latent column vector The whole set of equations (18.33) can then 
be neatly written as 



o o . . 

A 2 o • • 

o A ^ • 


o o 



= VL (say) 


(18.34) 


where L is a matrix whose elements are all zero, except on the principal 
diagonal which consists of the latent roots. 

We shall now prove by reductio ad absitrdum that V is non-singular (has 
a non-zero determinant). For suppose othei^vise: then there would be 
a non-zero vector x' such that Vx' — O' (by a similar argument to that 
used in the preceding section to show that there exists a latent vector). 
Of all such vectors x' — [xi, - . . a^n] choose one for which the num¬ 
ber m of components Xr differing from zero is as small as possible: say 
for example that x, o, X2 # o, . . . x^ # o, but all the other x^ s are 
zero. Then the equation Vx' = O' can be written also as 

XiVu) + a:2V(2)' + . . - + x„v^r„) — O' . . (18.35) 
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Since by definition I'd/ ^ O' it follows that m > i. Multiply 
(18.35) throughout by a; we obtain, by (18.33) 

+ •T2^2*'(2>' + . . . + .V,nA„,l',„/ = O' . . (18.36) 
Multiply (18.35) through by A„, and subtract from (18.36): we get 

x,(Ai —A,„)i'd/ +-V2(A2 —A^)i'( 2/ + . . . + :i:^_j(A^_i —A„)i'(^_j/= O' 

the term in V(„/ cancelling out. Now this is an equation of the same 
form as (18.35) containing only (m ~ i) terms; and that is impos¬ 
sible, since m had by hypothesis its smallest possible value. 

When there are fewer than n latent roots, the result may still be true 
in the sense that we can find a non-singular (n X n) matri.x y and an 
{n X n) diagonal matrix L such that aV = VL. Thus for the matrix a 
of example (i) of Section (18.13) have 

nr i o"| ni I o"] ni i o“i ni o o“| 

o I 0 0-2 0 = 0—2 0 0 j ol 

Lo i I J Lo I I J Lo I I J Lo O I J 

« . y = V , £ 

The columns of y are then necessarily latent column vectors of a, and 
the diagonal elements of L are the corresponding latent roots. But the 
proof that this is always possible is no longer valid, and in fact there are 
troublesome matrices a for which Kand L cannot be found. When this 
happens the theory becomes much more complicated. Fortunately it 
seems that this rarely occurs in practice, and we shall here consider only 
cases in which y and L exist, and y is non-singular. ^ 

Since F is non-singular it has an inverse F-*, and therefore from 
(18.34) 

aFF-» = FZ.F-*, i.e. a = F£F-i. 

and therefore 

F-'a = F-‘ yt F-> = LF-i 
Now let the rows of the matrix F"* be in order 

equation (18,38) is equivalent to the w equations 

«(„a = A,«,i), = Aji/ia), . . a = A„ 

The rows «<,) of F"* arc therefore latent row vectors of a. 

The equation (18.37) provides an interesting and important formula 
tor the powers a”' of a matrix a. For we have 

aa yty-^ yty-^ = ytty-^ — fz,* f-i 
= aa^ = VLV~^ VV- F-' = ytL^ F-* = FZ,^ F-^ 

and in general, continuing in this way, 

fl"* = VL ”* F- ‘ 


(18.37) 

(18.38) 


(18.39) 



534 


BIOMATHEMATICS 


Now V and L are matrices obtainable from the original matrix a. 
And L”* is easy to calculate. For a direct multiplication shows that 
X* = XX is a matrix with diagonal elements A2^ . . . and all 
other elements zero, X® = XX^ has diagonal elements Aj®, Ag^, , , 
A„®, and in general 





o 



(18.40) 


This has a further important consequence. The latent roots A^ are 
defined to be the roots of the “characteristic equation” (18.32), so that 


Ar" + Pn-l V-' + . . . Ar +Po = O 


for all values of r. It follows from (18.40) that 

i" +pn-iL"-^ + ... +PoI^O 


and therefore 

fl" 4- pn-i 4- . . . + pia 4* 

= FX" F-i 4-/)n-i FX"-i + . . . 4-pi FX F'* 4-/>oF/F-> 

- F(X'* -r4-... 4-pii +/>oOF-' 

.= VOV-^ = 0 ... (18.41) 

This result is often stated in the form “a matrix satisfies its own 
characteristic equation”. It can be proved to be true for all matrices, 
including the “troublesome” cases in which F and X do not exist, but 
the argument is then rather more subtle. 


18.15 Practical calculation of latent roots and vectors 

The following method [P. A. Samuelson, Proc. Nat. Acad. 29 
(1943), 393] is convenient. Take any column vector arbitrarily; a 
convenient choice is [i, o, o . . . o]'. Calculate by matrix multiplication 
the vectors Xi — ax^'y x^' = ox/, . . . Now by (18.41) 

.V,,' -r Xn. 1 ' 4- . • • 4- />1 Xi' PoXq 

= (o’* +pn-l fl”"* + . • ■ +Pia 4-/>o/)xo' = O. 

Written out in full this is a set of n simultaneous equations for the n 
unknowns. . ./>o- As a rule they can be solved, and we can 
find the values of/>n-2> • • • Po* Occasionally the equations may not 
determine the coefficients pn-\> pn-zy • • • Po uniquely. If so we ^art 

with a new vector Xq (say) = [o, i, o . . . o]' and so obtain some addi¬ 

tional equations. We continue in this way until we have enough equations 
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to determine the values of Pn-2> ■ • • />o- The characteristic 

equation is then 

A” + Pn-l + Pn-z . + /> 0 = o. 

We solve this equation, finding its roots Aj, A,, . . . A„, or at any rate 
as many roots as we are interested in. These are the required latent 
roots. 

For each latent root A,, a corresponding latent column vector \\r)' 
is found as follows. Calculate the numbers 

^n —1 ^» 

^n—i ^r^n —1 Pn — l> 

^n -3 = ~\~Pn-i> • • • 

^0 = +/>!• 

Then = K-iXn-i + K-2Xn-i + • • - -r hoXo- For 

^^(r) ^n — 1 ^Xn — i H“ 2 “h • • • "i“ ^ 0 q 

Xfi ~f“ ^n— 2 Xn~-i “f" • • • ^ qX i 

^ ( Pn — l Xn—i Pn — 2 Xn — 2 • • • P 0 ^ 0 ) 

+ (^r^n-l 4 " 4 “(^r^*n -2 4 " /*« - - 2 4 " • • • 

4 - {K^i 4 -/>i)^o 

= Xn-i 4- '^r^n -2 ->^0-2' 4 - • ■ • 4 “ A^A 1^0'= ■ 

Furthermore, if we take any arbitrary row vector >’o (say [i, o, o]) 
and find —y^a, y2 = y\a, . . . yn~\ hy matrix multipli¬ 
cation, then «(r) = ^n-i A’n-i 4 -/*n- 2 >’n -2 4 " • • • 4 "/'J 0 0 bc a 

latent row vector corresponding to the latent root A,.. 

We can now, if we wish, easily find the matrices V and F“*. V is 
found as the matrix whose columns are the latent column vectors 
•’(2)' • • • ►'(n)' written in order. Let U be the matrix whose rows 
are the latent row vectors «(,), «(2) . . . W(n). Then if the calculations 
are correctly performed the product UV should be a matrix D with all 
elements not on the diagonal zero. Thus UV =■ Z), D-^VV — D~^D 
— / so that = D~^V. 


EXAMPLE 

(i) We set out on p. 536 the calculations for the matrix 



The vector x®' = [i. o, o]' is chosen, and repeated'y multiplied by a to 
give Xx = axo'; X2 — ax/; Xg' — axj'; as we actually perform the 
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Calculation of latent roots and vectors 


2 


a 




— I I 

— I I 

I 1 




I 



I 


o 

I 

X 


Xo = I o 0^0=—* 

Xl = 2 I O = O 

j:2 = 3 I I I 

ATs = 6 32 



1 00 

2 —X I 

3 0 2 


»»(,) = 2 
V(2)= —I 
»’( 3 )= —1 




(ll(,) =20 2 

£/ = Jtf(2) = —I I I 

|«(3) = “I 3 


6 + 3 P 2 +2/>i +Po = o| _ 

3 + />2 + />! = o'-/)2 = -2,/)i = - 2 

2 + />2 =0) 

A3 _ 2A2 - A + 2 = O. A, = 2, A2= I, A3= -I 



computations these will appear as row vectors x„, at], *2. and *3. The 
equation + p,x, + p,x, + poX0 = ° then gives us the three equa- 

tions 

6 +3/>2 +2^)1 +/'o = o, 3+/>2 +/>i = o, 2+;>2 = o 

which we solve to give = —2, pi = —i, po = 2. (The solution in 
this particular case is so simple that we have not set it out, but merely 
stated the answer.) The characteristic equation 

= o then becomes A"* — zA^ - A + 2 = o, which again in this ii^tance 

is easily solved to give the latent roots Aj = 2, 

Using the values of p^, pi and po we calculate values of /I2, hi, and Ao 

for each latent root A,.; hz — i>hi= Xjiz +/'2» ^*0 = + Pi- ■‘Oe 

latent vector »V)' is then obtained as /lo^o' +^iJ^i' ^h^Xz ; but tor 
convenience of calculation we here use the transposed vectore /lo^o + 
hiXi + hzXz obtaining the row vectors V(,), V(2). »'(3) which are the 
transposed latent column vectors. Written together they form the 
transposed matrix V\ Similarly we take an arbitrary row vector ^0 and 
find — yv = A’2- convenience of calculation the matrix a 
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is here written in transposed form a' above the row vector we then 

find the elements of by multiplying each of the rows of a' in turn by 

the vector ^1. Then using the values of Hq, hi, already calculated we 

can find -f hiyz, the latent row vectors. Written 

together these constitute the matrix U. Finally we calculate D = UV, 
^-1 ^ 

An alternative method of evaluating latent roots and vectors is given 
by A. C. Aitken, Proc. Roy. Soc. Edin., 57 (1937), 269. 


18.16 Jacobians 

Let F = F{x, y) be a function of (say) nvo variables x and y. F will 
then have two partial derivatives dFjdx = = F^, and 

dFjdy = Z)y,a.F = F^. Suppose now that x and y are themselves func¬ 
tions of two other variables, say u and v\ then we know that the partial 
derivatives of F with respect to u and v are (Section 9.5): 

+ F„y,, 

Fv = + F^y„ 

But these equations can be expressed in matrix form as 


[F„ F,] = [F^ 

L^u 



(18.42) 


A similar expression holds for functions of three or more variables. 
Thus if G is a function of x, y, and z, each of which is in turn a function 
of u, V, and to, then we shall have 




The matrix 


C?.] = \G, 





•Vp 


L>’« 


(or the corresponding 


matrix for more than 


two variables) is known as the “Jacobian matrix”, and its determinant 
as the “Jacobian of x and y with respect to u and v'\ The determinant 
is usually written as d(x, y)l d(u, v). There seems to be no accepted 
notation for the matrix itself; it might be written as d{x, y)ld{u, v), or 
as D„,„[x, y]. 


E X A M P I. E 


(i) Take X and y to be ordinary cartesian co-ordinates in a plane, 
u and V to be {r, d}, the polar co-ordinates. Then x = r cos d, 
y = r sin 0 , the Jacobian matrix is 




[ cos 0 — rsin 

sin 6 r cos 

The determinant d(x, y)l 8 {r, 0 ) is therefore r(cos 0 )^ + r(sin 0 y = r. 


3 
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These Jacobians have several important properties. Suppose that 
u and V are changed by small amounts 8m, hv respectively. The corres¬ 
ponding changes Bx, By in a:, y respectively are (see Section 9.3): 

Bx ~ x^Bu -h 

By y^Bu +y„Bv . . . (18.43) 

or in matrix form 

rSA:"! ^ r^u r8w"| 

L^yj Lyu yvj 

or [Sat 8y]' [^> ;y] • * • • (* 8 - 44 ) 

which is the analogue of the one-variable relation Sa: ~ D^x . Bu. 

If in turn u and v are functions of variables r and s we shall have 


similarly 

whence 

But 


[Sm Bvy D,,,[u,v].[Br Bs]' 

[Bx ByY - [x. y] . />r.» [«, »] • [Sr Sj]' 

[8 a: ByY ~ [x, >’] . [Sr 85]' 


B [X, y] == />».v [X, y] . ^r., [W. v] 


(18.45) 


the matrix analogue of the one-variable formula DrX — D^x . D^u. By 
the determinant multiplication formula ( 18 . 25 ) ^ similar rule holds for 


the Jacobians 


^(^^v) ^ g(-v. y) ^(». tj) 

f(r, s) e{u, v) d{r, s) 


(18.46) 


In particular if we take r, j to be identical \vith .v, y, the left-hand 
side of equation (18.45) becomes [.v,y]; and on writing this out 
in full it will be seen to be the unit matrix /. It follows that the matrices 
/>u.v [x, r] and [w, v] are inverse to one another. This enables one 
to find the values of My, z'xt given those of at^, a:„, yt,. Thus for 
example by inverting the matrix Dr.e [a:, y] of Example (i) above we 
find the matrix [r, 0]: = cos B, = sin 9 y — r"* sin By 


0 ^ s= r~^ cos B. 

'I'his process of inversion becomes impossible when the determinant 
c (xy v)! e{u, r) = o, and so the matrix [.v, y] is singular. What does 
this mean? In some cases the determinant will be zero accidentally, so 
to speak, for particular values of u and Vy meaning that the inverse matrix 
[u, 1’] happens not to exist for such values (e.g. m* might become 
infinite). In other cases the Jacobian is zero for all values of u and Vy as 
for instance when .v = m v, y = sin u cos v cos u sin v, as the 
reader can readily verify. In such cases it can be shown that a; andy are 
no longer independent, but are connected by a functional relation: in 
the example given, y = sin u cos v + cos m sin v = sin (m + t?) = sin a:. 
Conversely if a* and y are connected by a functional relation, then 
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d{x, y)jd{u, v) = o for all values of u and v. A similar result holds for 
3 or more variables; if a:, z are functions of u, v, zv, there is a func¬ 
tional relation G{x, y, z) = o if and only if d{x, y, z)l d{u, v, w) 
is identically zero. A complete proof of this relation is a little difficult, 
but the following sketch will probably be sufficiently plausible. If 

= o it is possible to find a non-zero vector [a y3] such 
that [a [x.y] = O, and conversely. On multiplying both sides 

of equation (18.44) by [a we find therefore [a yS] [ 8 x Sy]' = o 
for all values of 8 u and Bv, and conversely if this is so [a ^ [a-, y] 

= O. But [a [Sa: Sy]' = aBx -j- ^By by direct matrix multipli¬ 
cation. By hypothesis not both a and ^ are zero: let us say a ^ o. 
Then the relation aBx +^Sy = 0 can be rewritten Bx = —(^/a)Sy. 
This means that when Sy = o, Bx = o also, i.e. when there is no 
change iny there can be no change in x, i.e. wheny is fixed, so is x, i.e. 
a: is a function of y. Conversely if there is a functional relationship 
>') = o tbe relation aBx -h ^By = o holds with a = F, and B = 
Fy since o — BF = F^Bx + FyBy. 


EXAMPLE 

(2) What is the condition that x should be a function of u 

Writey — u — v; the condition is then that the Jacobian 

= o. But y^ = I, y'v = — i. and so this becomes x,, -j- x^. 
Section 10.10). 



•Vu a:,, 

^ u yv 
= o (see 


Another use of a Jacobian is in the change of variable in a multiple 
integral. We showed in Section 16.13 that an integral over an area, 

f a dA, can be written either as J J" a dx dy in cartesian co-ordinates, 

or as <T r dr in polars. The general formula is 


J y* a dx dy = J J a 


^x^y) 

e(u, v) 


du dv 


(18.47) 


(and similarly for a triple or higher integral) which agrees with our result 
since (by Example (i) above) e{x, y)/ 5 (r, 0 ) = r. The general result 
can be proved by changing variables one at a time, since 


y) ^ ^(x, y) 8 (x,y)^ 

d{ii,v) d(x,v) d{u/v)* 

and it will be found that if we change the variables from {x, y) to say 
{x,v), the Jacobian e(x, y)l d(x, v) becomes simply y„,,. and the for¬ 
mula (18.47) becomes simply that for the change of a single variable 
But there is always the problem of relating the correct range of values of 
1/ and to those of x and y, and it is often best and safest to proceed by 
changing variables one at a time rather than to use (18.47). 
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19.1 Mathematical probability 

This world is notoriously a place of uncertainties. Some things may 
be certain or nearly so: few people would have doubts that 24-2 = 4, 
or that the earth is round, or that the blood circulates throughout the 
body. But for the most part our beliefs are tempered with doubt, and 
the most we can say of a future event is that it is likely or unlikely. And 
what is worse, no two persons will agree on how likely it is. One feels 
perhaps that they ought to agree, and some mathematicians have tried 
to construct rules of likeliness and unlikeliness which any reasonable 
person should obey. But, perhaps because we are none of us reasori- 
able, no such theories have so far gained any wide acceptance; and Ukeli- 
ness is a quality which must be considered as being outside quantitative 
measurement. 

However there are a few cases in which we can find a numerical 
measure of probability—and these cases, though few in number, are 
immensely important in practice. We shall begin with the simple 
example of the tossing of a coin. We know that if we spin a penny 
many times it will come down heads as often as tails. Common sense 
suggests that this is so, and a large number of experiments have con¬ 
firmed it. It is true that no real coin is absolutely unbiassed, but the 
difference from equality in heads and tails is inappreciable for most 
practical purposes. If we take a playing card from a well shuffled pack, 
replace it, shuffle again and repeat the experiment an indefinitely large 
number of times, we expect that one card in every four will be a dia¬ 
mond, and one in every thirteen a queen. These of course are average 
frequencies observed in the long run. When an event occurs in a certain 
fixed proportion of cases in the long run, we call this proportion the 
“probability” of its occurrence, and say that there is a statistical 
regularity. Thus in tossing a coin the probability of obtaining heads is^ 
since heads will occur once out of every two throws. The probability 
of drawing a queen from a well-shuffled pack is xV • probability of 
drawing a diamond is 

More precisely we may define a probability thus. Suppose an event 
E can be followed by an event an event A 2, or an event .^4 3; that is, 
it must be followed by one and only one of these three events. Suppose 
further that in the long run it is followed by A^ in a proportion p of 
occurrences, by .<4 2 in a proportion of q, and by ^3 in a proportion r: 
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then p is called the probability of given E, and written Pr{A^\E)\ 
q is called the probability of ^2 given E, and r the probability of ^3 
given E. We have chosen here the case of three possible subsequent 
events, but of course there is no special reason for this except for the 
sake of illustration. There could be two, four, five, six or any other 
number of possible subsequent events, and the same definition would 
hold. But since in any case some one of the e^'ents must always follow E, 
the sum of all the probabilities must be unity; 

/> + 9 + r = I . . . (19.1) 

For example, the event E might be the spin of a coin. The possible 
subsequent events might be A^, the appearance of heads, and A 2, the 
appearance of tails, with probabilities p and q respectively. Then p = 
or very nearly and q = i, so that p q = i. If £: is' the withdrawal 
of a playing card from a pack, the outcomes could be classified as . 4 i, 
spades, A 2, hearts, A 3, clubs, and A „ diamonds, each with probability ]. 
In such cases the probability is an exact simple fraction. But that is not 
always the case. If E denotes the birth of a child in London, there is a 
certain probability p oi A^^ that it is a girl, and a certain probability q 
of A 2, that it is a boy. Here p is approximately 485, since 48 5 per cent 
of births are of the female sex. 

There are several points to notice in connection with this definition. 
The first is that it involves repetition. Sometimes the repetition may be 
imaginary. An experimenter may be satisfied, from his common sense 
and general experience, that if he repeats an experiment a large number 
of times he will get a result A in a certain proportion p of cases. But 
he will not as a rule want to repeat the experiment at all, or at least not 
very often. Here the probability p represents what would happen if he 
did repeat the experiment. But something which cannot be repeated 
does not have a probability. We do not speak of the probability that 
the Statue of Liberty is 184 metres high: it either is 184 metres high, 
or it is not, and that is an end to the matter. Similarly, and most im¬ 
portantly, we never speak of the probability of a hypothesis being true. 
It either is true, or it is not: it does not fluctuate between being some¬ 
times true and sometimes false. Of course we can always say that a 
hypothesis is probable or improbable in the everyday sense of these 
words: but we cannot speak of its having a probability p in the tech¬ 
nical sense. 

It is also important to have a clear definition of the events E and Ar 
concerned in the definition. Thus if we want to know the probability 
of an experiment E giving a result A^ we must specify exactly what we 
mean by “i?”; or what comes to the same thing, what we would do if 
we were called on to repeat the experiment. 'I'hus it is not sufficient to 
ask “what is the probability of being short-sighted?” We can ask 
“What is the probability of a man of age 37 chosen at random from the 
population of Glasgow suffering from myopia?” provided that we have 
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an exact definition of myopia: we can also ask “What is the probability 
of a man over 21 chosen at random out of England and Wales being 
myopic?’* We have referred to E, Ai, A2, and Aq as events: it might 
be more accurate to speak of them as “kinds of events’* in a very 
general sense of the word “event**. All we require for our purposes is 
that we shall be able to say in any given situation whether an event of 
kind E has happened or not, and for our purpose an event may be defined 
as “something which does or does not happen**. 

We also meet here with the word “random**. To choose a person 
“at random’* from a population means to choose in such a way that any 
one person is as likely to be selected as any other. It is a phrase whose 
meaning is readily understood, but which is difficult to define in precise 
terms. It should be well noted that perfect randomness is surprisingly 
difficult to obtain in practice. Investigations show for example that a 
pack of cards is usually very badly shuffled. And if a person is asked to 
make a random sample of wheat plants in a field or pedestrians in a 
street, using no other guide than his feelings as to what is random, the 
result will inevitably be a heavily biassed sample. Thus the greatest 
difficulty in applying the theory of probability in practice is that of 
making sure that we really have true randomness. For if that is not so, 
any subsequent conclusions will be invalid, however accurately the 
calculations are performed. 

The word “chance” is often used instead of “probability”, and is 
(in its technical sense) a synonym: a “chance of i” means the same as 
a probability Frequently too we speak of the “probability of an event 
A'\ instead of the more accurate term “the probability of A given E”. 
This is a convenient practice, since the nature of the antecedent event E 
may be fairly obvious; e.g. we can speak of the probability of a newly 
born child being a girl, E being the event of a birth occurring. Some¬ 
times too the antecedent event E may be the same throughout the whole 
of a calculation, and one will not want to refer to it continually. But 
there are dangers in the omission of the mention of E, and it is important 
to keep the exact definition of E in mind, even when it does not occur 
explicitly in the symbolism. 

'I'he adjective “stochastic”, which is frequently used, means “per¬ 
taining to chance”, or “random” (Greek stokhdstikos, conjectural). 

We should also note that some writers use a different sense of the 
word “probability”, e.g. Jeffreys* “inverse probability” and Fisher’s 
“fiducial probability”. The use we have given above is the one which 
is generally accepted by statisticians—as shown by their actions, if not 
always by their words. This is not intended to prejudge the usefulness 
or otherwise of different definitions of probability; the reader can study 
tliem elsewhere and form his own opinions. Our definition is simply 
the most usual one, as well as being practical. To go outside it would be 
to enter unnecessarily into controversial topics, without doing justice to 
the issues involved. 
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X9.2 Negation of an event 

If the probability of an event occurring (given E) is p, and the 
probability of it not occurring (given E) is q, then/) + j = 1, for either 

or not--/ii must occur. So ^ = i — p, or 

Pr (not-v4i|£) = I — Pr(A,\E) 

If an event always occurs, its probability is i; if it never occurs its 
probability is o. However we cannot quite say that a probability of o is 
equivalent to impossibility. For presumably there is a probability o 
that a man selected at random will have a height exactly 17 metres, or 
any specified height h that we wish. We should never expect this to 
occur in practice, although we might find men with heights exceedingly 
near 1*7 m. But we cannot say that it is impossible, or we should be 
driven to the conclusion that it is impossible for a man to have a height. 
The reason for this paradox evidently lies in our idealization of the 
measurement of height: for convenience we imagine heights as meas¬ 
ured with complete accuracy, i.e. as unending decimals, although in 
practice only a limited accuracy can be achieved. Having made this 
idealization we are forced to allow a second one, that the attainment of 
any one particular height is infinitely improbable, but not impossible. 
Such an event is sometimes said to be “almost impossible”. 

19.3 Odds 

If (given E) an event A^ happens three times as often as an event A^ 
we say that the odds are three to one (or 3 : i) on , (against A^. When 
AI and A 2 are the only possible outcomes this means that Ay occurs 
three times out of four, and A 2 once; the probability of A^ given E is 
therefore |. More generally if an event E has a number of possible out¬ 
comes Ai, A2, A3 . . . A„, and in the long run for every w, times that A^ 
occurs, occurs Wg times, A3 times, and so on, then A^, A2, . . . A„ 
have the odds or relative probabilities : Ug : U3 : given E. 

Since this means that Aj occurs on the average times out of every 
(«i + 1^2 4 " ■ + ^n)t this is equivalent to saying that the 

probability of (given F) is «,/(«, + 1/2 + “3 4 - . . . -f u„). But often 
it is more convenient to use odds: thus we speak in genetics of a 3 : 1 
ratio, meaning that we get three dominants to every one recessive. 
Alternatively we can say that the probability of a dominant is J. Clearly 
if the odds on an event Ai are m : i, the odds on not-/ 4 j are i : u, or 

: I. 

X9.4 Addition of probabilities 

The chance of drawing a diamond from a pack of cards is i, the 
chance of drawing a heart is J. So the chance of drawing a red card 
must be i 4 - i = i- For out of every four cards drawn on the average 
one is a diamond, one is a heart, and so i 4- i = 2 are red cards, i.e. 
the probability of drawing a red card is (i 4- i)/4 = ^. 
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More generally if ^2. ^3 • • • are (mutually exclusive) possible 
outcomes of an event with probabilities Pz • • - r^pectively, 

then the probability of an outcome being either Ai or A2 Pi +/*2» 
and the probability of it being either or ^2 or is pi -f P2 +p3* 
For this probability is simply the proportion of cases in which we get 
the outcomes or ^2 or ^3, and since A^y A^t and A^ are by hypo¬ 
thesis mutually exclusive events (i.e. no two can happen at the same time) 
the proportion in which at least one of the three possibilities occurs is the 
sum of the proportions in which they occur separately. 

Pr{Ay or ^2 or ^3!^) = Pr{A^\E) + Pr{A2\E) + Pr{A^\E) . . (19.2) 

If the relative probabilities or odds of A ly A 2, A 3, etc. are in the ratio 
«! ; M2 : M3 . . . then the odds of (.< 4 i or .(^2), .^3, .^4 • - • (mj -|- Mg) 

: M3: M4. . . 

Note that it is essential for the events to be mutually exclusive. If 
the probability of a person selected at random being male is of being 
right-handed is (say) and of being in good health is 1%, it does not 
follow that the probability of being either male or right-handed or in 
good health is ^ + 4 + tu = 2*26! 

19.5 Multiplication of independent probabilities 

Suppose we toss a coin and at the same time select a card at random 
from a pack. Then in one case out of every two that we do this the coin 
will fall heads: that is, on four occasions out of eight. On one of these 
four occasions the card selected will be a diamond: so we obtain both 
heads and a diamond on one occasion in eight. That is, the chance of 
the combined event is ^ i X and is obtained by multiplying the 
chances of the separate events. 

This holds in general. Suppose an event E is followed by a conse¬ 
quence .^1 in a proportion pi of cases, and an event P is followed by 
consequence Pi in a proportion Pi. Consider the consequences of the 
joint occurrence of E and P. If it is true that the events Ai and Bi are 
“independent”, that is, if the occurrence or non-occurrence of Ai has 
no influence on the chances of jBj happening, then out of the fraction 
Pi of cases in which Ai occurs, a further fraction Pi will be occurrences 
of Pj, i.e. a proportion PiPi of the whole. This is illustrated geometri¬ 
cally in Fig. 19.1. At the fashionable supper parties organized by a 
certain not very well-known society hostess guests are approached and 
asked to draw a card from a pack and a counter from a bag containing 
one white, two green, and three yellow counters. If they draw a yellow 
counter and a red card they are asked to recite; if they draw a green 
counter and a spade they are required to sing, while those drawing a 
white counter and a club must assist the conjurer. To represent this we 
take a square of unit side and divide it into three strips of width 11, §, and 
iy respectively. The areas of these strips represent the chances of 
drawing a yellow, green, and white counter. Now the drawing of the 
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2/4 

Red 


1/4 

Spade 


1/4 

CluO 


Fig. 19,1—Multiplication of independent probabilities 

yellow counter may be combined with that of a red card, a spade, or a 
club: to show these three possibilities we divide the strip into hori¬ 
zontal strips of height if, and \ respectively: the corresponding areas 
show the chances of the respective combinations. Thus the combin¬ 
ation of a yellow counter and a red card occurs in a fraction i of the first 
vertical strip, i.c. in a fraction ^ x i of the whole area. This can also 
be seen by dividing the square into twenty-four equal small rectangles, 
each representing a combination of a particular suit and a particular 
counter. Each of these combinations has equal probability ttV: and 
since there are six of them in the rectangle corresponding to the com¬ 
bination of a red card and a yellow counter, it has probability 6/24 = 1/4. 
Similarly a green counter with a spade has probability 2/24 = 1/12; and 
a white counter and a club, 1/24. 

It should be noted that this product rule is not so much a theorem 
as a definition: to say that events are statistically independent is to 
assert that the product rule for probabilities is satisfied, and strictly 
speaking can only be tested experimentally. But in many cases it will 
be fairly obvious that events are, or are not independent. Thus the num¬ 
ber of mice in a litter is unlikely to have any relation to the day of the 
week on which they are born, but will not be independent of the genes 
carried by the parents. Incidentally there is a common fallacy that 
successive throws of a coin are not independent: if a coin has come 
heads several times running the “law of averages” is said to predispose 
It to come tails the next time. No such effect occurs when the tossing is 
properly done, and the “law of averages” is a mystery unrevealed to 
mere mathematical statisticians. 
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Note that the events E and F referred to in the definition of indepen¬ 
dence need not be distinct. E might be the birth of a child, for which 
there is a certain probability of being male and a certain probability of 
being of blood-group O. These events are practically independent: the 
percentage of males of group O is the same as the percentage of females, 
or very nearly so. So we can find the chance that a baby will have male 
sex and group O by multiplying the chances of maleness and of O group. 

19.6 Genetical applications 

Most living organisms are diploid, that is, they carry in every cell a 
set of chromosomes, and the chromosomes of each set occur in pairs. 
These chromosomes carry the “genes’* or units of heredity: and these 
genes will likewise go in pairs. Thus in men there are two genes' M and 
N which determine the so-called MN blood-groups; and any individual 
will carry two such genes. They may be either both iV/, so that we can 
write the genetic formula or “genotype” of the individual as TVfM, or 
they may both be AT, or one may be M and the other N, giving genotype 
MN. These three genotypes can be readily distinguished by tests on 
the blood, and correspond to three separate blood-groups, MMy MN^ 
and NN. (The group MM is often more briefly written M, and the 
group NN as N\ but the correct formulas from the genetic point of 
view are MM and NN.) Now a parent can hand down only one of his 
two genes to any child: but which one he hands down seems to be 
entirely a matter of chance. Thus a parent of group MM must hand 
down a gene M, and one of type NN must hand down an N, but one of 
type MN is equally likely to hand down an M or an AT gene. From this 
we can readily calculate the chances for the offspring of any particular 
mating. If both father and mother are MNy each of them has a chance 
^ of passing on an M gene, and i of passing on an N. So there are four 
possibilities— 


Gene from 
father 

Gene from 
mother 

Genotype of 
offspring 

M 

M 

MM 

M 

N 

MN 

N 

M 

MN 

N 

N 

NN 


By the multiplication rule each of these has chance | X i — i- But the 
argument shows that there are two ways of obtaining an MN child: we 
can combine these according to the addition rule, giving a total chance 
j X J i of the child being MNy i of it being MM and i of it being 

NN. " .... r 1 

Other situations can be dealt with in a similar way: for example, a 
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mating MN X NN will give on the average equal numbers of MN and 
NN children. 

A further complication is caused by the phenomenon of recessivity. 
Mendel in his classic experiments on peas found that there were two 
genes Y and_y responsible for the colour of peas. But while the combin¬ 
ation yy produced a green pea, both of the combinations Yy and YY 
gave a yellow one. We can say that the gene y produces no obser\'able 
effect when Y is also present, or that it is “recessiv'e” to Y, and that Y 
is “dominant” tojy. In such a case the mating Yy x Yy still gives the 
three types YY, Yy, and yy in the correct proportions of h, and 
but the first two types are no longer distinguishable by observation. So 
the total probability of a yellow pea from such a mating is the proba¬ 
bility of either YY or Yy, which by the addition rule is j -}- i = |. In 
the long run we shall find three yellow peas for each green. Repetitions 
of Mendel’s experiments by various investigators have given a total of 
137,407 yellow and 44,692 green peas, or 75 09 per cent yellow and 
24 91 per cent green. Such recessivity is quite common in genetics. In 
the human A-B-O blood-groups we can distinguish at least four 
genes A^, A^y B, O. Here O is recessive to all the others, so that the 
combinations A^O and AjAi are indistinguishable and constitute 
group Ai, the combinations A and A2A2 both give group Ao, and 
OB and BB both give group B. Furthermore A 2 is recessive to A^, so 
that the genotype AzA^ is classified as of blood-group A^. The geno¬ 
type 00 corresponds to blood-group O. By using these facts we can 
readily determine the chance of any blood-group arising from any given 
mating. For instance the mating Ox A^B OO x A ^B will give 
children of genotypes OA^ and OB with equal probabilities: and they 
will have blood-groups Ai and B respectiv'ely. 

19.7 The probability of successive events 

The multiplication theorem for probabilities can be generalized in 
the following way. Suppose that E is an event which has a number of 
possible consequences z 3 ,, A 2, . • • A,„ with respective probabilities 

pn- Suppose further that A^ has in turn various consequences 
Bz, . . . with probabilities /*,, Pj, . . . P,„ respectively. Then 
the probability that E will be followed by A^ and then A, by /i, is 
For in a proportion />i of cases E will be followed by Aj ; and in a pro¬ 
portion P, of the cases in which A^ occurs, it will be followed by B^. 
But this is a proportion PiBi of all the cases. 

We can write this most generally as 

Pr (A, Of B,IE) Pr (A,IE) Pr (B,IA, & E) . (19.3) 

but in many applications the probability that A^ is succeeded by Bi 
will not be affected by the previous event E, and we obtain simply 

Pr (A, & B,\E) = Pr Pr (B,\A,). 
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EXAMPLE 

(i) Edward, of group MN, marries Ethel, also of group MN. They 
have a child Alice, of unknown group, who marries Andrew, of group 
MN, They in turn have a child Betty. What are the chances of the 
various combinations of blood-groups for Alice and Betty? The calcu¬ 
lations are shown diagrammatically in Fig. 19.2. Alice has a chance i 


ALICE 

MM MN NN 



Fig. 19.2—The product law for probabilities of successive events 

of being MM, i of being MA- and | of being A’.V. We take a unit square 
and divide it into three strips of width i, and ^ respectively: the 
areas of these strips represent the proportions of cases in which the three 
genotypes occur. Now if Alice is MM, the mating of Alice and Andrew, 
being MM >. d/.V, gives Belly a chance of being MM and h of being 
MN. We therefore divide the first strip into two equal parts, represent¬ 
ing the cases in which Alice is AIM and Bettv AIAI, and those in which 
Alice is MM and Betty MA\ Each of these combinations therefore has 
probability ] X A = Similarly if - 41 ice is ALX her mating with 
.Andrew gives Bettv a chance :} of being AIAI, § of AI.\, ^ of A.V, and 
we tlierefore divide the central strip into three parts in the ratio ] : ^ : i- 
So the combinations Alice AIN and Betty AIAI or AW each have chance 
i X ] - - i : and the combination Alice AIN and Betty AIN has chance 
i X ^ “ }. In the same way we divide the last strip into two parts, 
giving equal probabilities } X i == J for the combinations Alice AW, 
Betty AIN and Alice AW, Betty AW. 

Suppose now we wish to find the chances for Betty irrespective of 
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Alice’s genotype. We can express the event “Betty is in the form 

“either Alice is AIM and Betty is MN 
or Alice is MN and Betty is MN 
or Alice is NN and Betty is A/iV” 

and these are three mutually exclusive events. So the probabilitv that 
Betty is MNy given the original genotypes of Edward, Edith and 
Andrew, is the sum of the chances of the three events specified above, 
i.e. i + i H- 1 Alternatively we can add all the areas in Fig. 19.2 

corresponding to the cases Betty = MN. This is quite readily done if 
we imagine the area divided into sixteen equal small squares: the parts 
of the diagram marked MN contain eight of these small squares, giving 
a probability 8/16 = 1/2. Similarly we find that Betty has a chance J 
of being MM and ^ of being NN. 

19.8 Conditional probability; selection 

As we have seen, a mating of two pea plants of formula Yy, or a self- 
fertilization of a Yy plant, gives three yellow peas to every green one. 
Now these yellow peas, although all alike in appearance, fall into nvo 
types genetically, YY and Yy, and as Mendel discovered, these types 
can be distinguished by further breeding experiments. Thus we can 
ask: supposing we select at random a yellow pea from the progeny of a 
Yy X Yy mating, what is the chance that it will be YY? Now we know 
that in the complete progeny, including the green peas, the genotypes 
YY, Yy, yy occur in the ratios 1:2:1. If only the yellow peas are 
considered the genotypes YY and Yy must therefore still occur in the 
ratio I : 2, so that the probability of a yellow pea being YY is J, and 
that of it being Yy is §. 

T’hese probabilities can also be evaluated in another way. The yel¬ 
low peas form J of the whole progeny, and the YY peas, which are all 
yellow, arc ] of the whole. Therefore they constitute a fraction j/J == 
of the yellow peas. 

A similar slate of affairs occurs in the breeding of yellow mice. 
Again we have two genes Kandy: but hereyy is normal, Yy is yellow, 
and YY dies before birth. Hence from a mating of two yellow mice, 
Yy X Yy, we shall get progeny in the ratio two Yy yellow, one yy 
normal, the YY class being unrepresented. 

In general, suppose that an event E has possible consequences Ai, 
A^. . . . Suppose further that the classification Ai can be further sub¬ 
divided into sub-classes . . . B„„ each of which has a known 

probability of occurrence, given E. Then the probability of a conse¬ 
quence of E being of kind /i,, when we know that it is of kind A^, is 
obtained by dividing the unconditional probability of B^ given E by the 
probability of A^ given E\ thus 

Pr {H,\E & A,) = Pr (B,\E)IPr(A ,\E) . . (19.4) 



55° 


BIO MATHEMATICS 


This shows that the odds or relative probabilities of the conse¬ 
quences Bi, ^2,... are unaltered by the restriction that we consider 
only those consequences of E which are of type The actual proba¬ 
bilities must be divided by Pr {^Ay\E) to ensure that their sum shall still 
be unity. 

The formula (19.4) is the formula for the “conditional probability” 
of Bi given E and A^. It can be taken as representing the effect of selec¬ 
tion on a population, e.g. of selection of the yellow peas out of the popu¬ 
lation of all progeny of a Yy X Yy mating. It can also be considered 
as showing the effect of additional information on a probability: here, 
the information that a pea selected at random is yellow. Finally it ^n 
be considered as formula (19.3) viewed in reverse: if we divide equation 
(19.3) through by Pr (^i|F) and interchange the right- and left-hand 
sides we obtain 

Pr {B,\A, &E)^Pr {A, & B^\E)jPr {A^\E). 

This is identical with (19.4); for in (19.4) Bi was defined as a sub-class 
of Ai, so that if 5 , occurs Ai must necessarily occur too: “^1 & B,” 
is equivalent to “Bj”. 


EXAMPLE 

(1) Consider the family affairs of Edward, Ethel, Alice, Andrew, and 
Betty, discussed at the end of the last section and illustrated in Fig. 19.2. 
Suppose Betty’s blood is tested and found to be of group MM. What, 
with this additional information, are the respective chances of Alice 
having each of the three groups MMy MN’y and AW? 

From Fig. 19.2 we see that the only cases in which Betty has group 
MM are [Alice AIM, Betty MA^ and [Alice AiN, Betty AIM]: and 
before Betty’s group was determined, these were of probabilities i and 
I respectively, the total probability being So after we know Betty’s 
group to be AJAI the chance of Alice being MM will he m and 
that of being AIN will also be 


These rules of addition, multiplication and division constitute the 
“Calculus of Probabilities”, and any problem in probability theory can 
be solved by their application. The most difficult to apply is the last 
one on conditional probability. Here however the chief difficulty is 
that of keeping the meaning of the operations clearly in mind. Thus 
when we say that the “probability of Alice being MMy given that Betty 
is AIAIy is we mean that if we choose from a very large population 
all those families in which the mother’s parents are both of group MNy 
the father is also of group AIN, and the daughter is MM, then in half 
the cases we shall find that the mother is of group AIM (and in half the 
cases of group AIN). 
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PROBLEMS 

(1) Bill is the son of Arthur and Ada, of known blood-group geno¬ 
types A^B and OO respectively. Bessie is the daughter of Charles and 
Catherine, AiA^ and AyO respectively. If Bill and Bessie marry and 
have a daughter Doreen, what probability has she of having any given 
blood-group.? If on examination she turns out to be OO, what can you 
say about Bill and Bessie? 

(2) Frank, A^A^y and Felicity, A^O, have a daughter Gwendoline, 

who marries George, AiB. What are the possible blood-groups of their 

children, and with what chances? Suppose the first child, Herbert, 

turns out to have group A ^B, what will the chances then be for a second 
child? 

19.9 Random mating 

Consider a population of individuals, say for example, that of Great 
Britain. Each individual will carry genes for any particular character: 
and so can be classified according to his or her genotype. Thus there 
will be a certain proportion of individuals of group MM, a pro¬ 

portion of group MA^ and a proportion of group NN. These 
proportions will be the same for the two sexes: this will be the case for 
most genes except sex-linked genes. 

If the population of Britain is n, there will be pM^n individuals of 
group MM in Britain, p^JNn of group MN, and p^^f^ of group NN. Let 
us conventionally count each individual as contributing two genes: then 

PA/Af« persons of group MM will have between them 2 pM^n M 
genes, the Pmn^ of group N will have p.uv« M genes, and p^jv” A^’s; 
while the pt^j^ individuals of group N will have between them 2 p w 
Altogether there will be {zpMj,j -f- genes M and {pj,^^ d-2P;^.v)'* 

genes N among a total of 2/1 genes altogether. We can accordingly define 
a “gene frequency” p = {zp^tM + PmMzh = p^,,„ + ^p^,^ which 
represents the proportion of M genes among the total population of 
genes. The frequency of N will be ^ = ipjv,;v + Pmm = i - p, since 
PmM + PmN + PNN — t • 

Blood-group investigations in the British population have given the 
percentages 30 per cent MM, 49 per cent MN, 21 per cent NN; so 
Pmm ~ pMN = pNN = ‘21, and the gene frequencies are p = 
•30 + i X *49 = - 545 , y J X -49 -I- -21 = -455. 

Now when a man marries, he is scarcely likely to worry about his 

bride’s blood-group, or she about his. So we can expect the various 

genotypes to mate independently: the chance of a husband and wife 

chosen at random having genotypes MM and NN respectively will be 

PmmPnn- This is known as “random mating” or “panmixia”: it can be 

expected of genes for blood-groups and other characters which produce 

no visible external effect, but not for qualities such as height or intelli¬ 
gence. 
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Now consider a child chosen at random. Its father has a probability 
Pmm of being MM, of being MN, and of being NN. The chance 
that the gene handed down from the father was M is therefore Pmm + 
hpMN = P\ 3nd the chance that it was N is \Pmn + Pnn = The same 
holds for the mother, and because of random mating the two genes will 
be independent. So we have the followng four possibilities: 


1 

Gene from father • 

Prob. 

Gene from mother 

1 

Prob. 1 

1 

Child 1 

Prob. 

M 

P \ 

M i 

P ' 


P^ 

M 

P 1 

N 

i q 


pq 

N 

q 

M 

p 


pq 

N 

q : 

\ N 

1 

q 

1 


1 

1 


Thus the child has a probability P,vi\/(say) — p- of being MM, Pmn — 
pq + pq ~ 2pq of being MN, and P^.w = g' of being AW. This is the 
“Hardy-Weinberg” law. The probabilities Pmx, will be the 

genotype frequencies in the new generation, and the gene frequencies 
will be accordingly P = P^At + ^ p^ ^ pq = p{p + q) = P and 

Q = \Pms “t -P.v.v = Pq q“ ~ q- That is, the gene frequencies will 
not be changed from one generation to the next, and the whole process 
will repeat. 


EXAMPLES 

(1) For Britain the frequencies of M and N are *545 and *455. So 
the genotype frequencies should be (-545)- = -297 KiM, 2 x *545 X 
•455 = *495 MS, and (-455)“ = -207 SS. These agree well with the 
observed percentages. 

(2) Henrietta, of group MM, marries Harold, of unknown blood- 
group. They have a son Ivor: what can we say about Ivor’s group? 


I'he calculations proceed as follows: 


Harold’s 

group 

1 

Prob. 

1 

Ivor’s group 

j ilLV/ 

! MN \ 

MM 

MN 

NS 

» 

P- i 

2/>7 

q- 

1 A" - ' 

2pq . i 

0 

1 

0 

: 2/><7 . i 

, 9" • I 1 

Total 

\ I 

. P 1 

1 

1 

1 \ 

q 


NN 


o 

o 

o 


o 
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We know that Harold has a chance /)- of being MM, zpq of being 
MN, and of being NN. If he is MM, since Henrietta is also MM, 
Ivor has a subsequent chance i (= certainty) of being MM: so the 
chance of Harold being MM and Ivor MM is p- . i = p-, as shown in 
the table. Similarly the chance of Harold being MX and Iv'or MM is 
2.pq . \ = pq. By addition of all the ways in which Ivor can have group 
MM we find this has total probability p- pq = p{p q) = p\ simi¬ 
larly he has probability pq -\- q~ — {p -\- q)q = q of being MX, and 
zero probability of being AW. 

(3) Suppose that in the last example Ivor was tested and found to 
have group MM. What can we then say about Harold.^ 

By the formula for conditional probability, the chance that Harold is 
MM given that Ivor is MM 

~ Pr [Harold MM & Ivor MXr\'Pr [Ivor A/.I/], 

in each case the general form of the pedigree and assumption of random 
mating are given. That is, probability that Harold is AIM = p-'p = p\ 
similarly the probability is q that he is MX. 

If we have more than two allelomorphic genes, a similar formula will 
hold. 'I'hus in the case ol the . 4 -B-O blood-groups, let the frequencies 
of the Ai, A 2, O, and /i genes be p, q, r, and s respectively, where 
P + q 4- r s -- 1. 'rhen the genotype A,At will occur with fre¬ 
quency P“, type A2A2 with frequency q^; while A1A2 can arise in two 

Oene from falher 
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V 
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V 


B 

Fig. X9.3— The blood-group frequencies in the general population 
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ways, according as the gene conies from the father or the mother, 
and so has frequency 2pq. This is illustrated in Fig. 19.3 where the 
vertical strips are of relative widths p, 5, r, and s (= 21 per cent, 7 per 
cent, 66 per cent, and 6 per cent respectively in a British population), 
representing the relative probabilities of an A 2, O, or B gene respec¬ 
tively being handed on by the father. The horizontal strips correspond 
to the same thing for the mother; the areas of the small rectangles 
into which the square is divided represent the probabilities of the result¬ 
ing genotypes of the individual concerned. Now a B gene is dominant, 
and will be detected wherever it occurs. Such cases are shown by the 
horizontal shading. An Ai gene is also dominant; its presence is indi¬ 
cated by the denser oblique shading; while an .^42 gene will be detected 
except when Ai is also present, and is indicated by the less dense 
oblique shading. The frequencies of the serologically distinguishable 
blood-groups in the population will therefore be: 


Blood-group 

Genotypes 

Frequency 


A-^A^y A1A2, Ai 0 

p^ 4 - 2pq + 2pr 

A,B 

A^B 

zps 

A, 

A^Aoy A 2O 

q^ + zqr 

A^B 

AoB 

zqs 

0 

00 


B 

BBy BO 

4 - 2jr 


PROBLEMS 



(1) What frequencies of blood-groups would be expected in a popu¬ 
lation in which= *15, q = -03, r = -75, s = -07? 

(2) Kenneth (group O) marries Kate (of unknown group), and has 
a son Leonard. What is the chance that Leonard is of group B} If 
Leonard’s blood is tested and found to be By what is the chance that 
Kate is A^B"^ 

(3) A dominant gene G has frequency py and the corresponding 
recessive gene g has frequency q = i — p. Maurice is a man picked at 
random and found to be of type G (i.e. GG or Gg). What are the chances 
that (i) his son Norman, (ii) his father Leonard are of type G} If Leo¬ 
nard is found to be ggy what is the chance that his brother Michael is of 
type G? 

Dr. H. Harris studied the inheritance of premature baldness in 
men, occurring before the age of thirty (Ann. Eugen. Lond.y 13 (1946), 
172). He formed the hypothesis that it might be due to a dominant gene 
which shows only in men, i.e. men of type G are prematurely bald, but 
men of type^^ and all women are non-bald. Taking/> = *07 does this 
agree reasonably well with the following observations?— 
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(a) out of 900 men of appropriate age, 120 (= 13 3 per cent) were 
prematurely bald; 

(i) out of 100 prematurely bald men taken at random, 56 had 
prematurely bald fathers; 

(c) out of 41 brothers of prematurely bald men with non-bald 
fathers, 19 were found to be prematurely bald. (Small discrepancies 
between the theoretical and observed proportions may be due to samp¬ 
ling deviations: for an exact treatment of these, see Section 21.4.) 

19.10 Cousin marriage 

It is sometimes said that the children of marriages of first cousins are 
at a disadvantage: they are more likely to suffer from the effects of 
harmful recessive genes than those children whose parents are unrelated. 
We can readily calculate the magnitude of this effect. 

Suppose that there are a pair of genes A and a in a population, with 
frequencies p and q respectively. Let X be an offspring of a first-cousin 
marriage, but otherwise chosen at random; and suppose that apart from 
the relationship between X's parents all mating is at random. Now X 
obtained one gene from her father: and this has chance p of being A, 
and chance ^ of being a. AT also obtains one gene from her mother: but 
this is no longer completely independent of that obtained from her 
father, because there is a possibility that they may originally hav'e been 
the same gene passed down on both sides of the family from a common 
ancestor of the mother and father. The gene X gets from her father can 
have come from any one of her father's four grandparents: and the 
other gene can have come from any one of the mother’s four grand¬ 
parents, making altogether sixteen possible pairs of parent’s grand¬ 
parents from which X can get her two genes. But since the mother and 
father have two grandparents in common, there will be two cases out of 
the sixteen in which X gets both genes from the same greatgrandparent, 
making a probability 2/16 = 1/8 of this occurring. However, even 
when both genes come from the same greatgrandparent G, it does not 
follow that they must be alike. For G has two genes, one derived from 
each parent; and he or she is just as likely to hand down genes derived 
from different parents to the two sides of the pedigree as to hand down 
genes derived from the same parent. 'Fhus finally we sec that X's two 
genes have a probability ^ x J ,\i of being descendants of a single 
common ancestral gene. Since this gene had a probability p of being A 
and q of being a, it follows that in this case X has a chance A/) of being 
AA and Av of being aa. Now in the remaining fifteen cases out of six¬ 
teen, X's genes will be derived from unrelated sources, and the random 
mating formula will hold: the probabilities will be |S/)- for A A, 
^pq for Aa, and \%q^ for aa. In all the chances for X will be 

-A(/> + i5/>^) . A A : 


V/>^ . Aa : -A(y -f i$q^)aa 
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as against : 2pq : q- for a child of unrelated parents. Thus as a 
result of cousin marriage the chance of X suffering from a recessive 
condition aa is increased by + i 59 “) — — Q)' This 

never exceeds which is its value when q ^ and for small q, which 
corresponds to a rare recessive, it will be very small indeed (being 
approximately 1^^). Thus viewed from the child X'q point of view the 
relationship between the parents is of little importance. However it is 
important for the geneticist. For if we consider not the difference but 
the ratio bet^veen the probabilities in the two cases, we see that the 
chance of suffering from aa is increased iV(^ + + 

15/16 times, as compared with the child of unrelated parents. When q 
is small the term 1/169 may become quite large: so that cousin marriages 
give a relatively high probability of suffering from the condition. And 
therefore of the cases actually obser\^ed a higher proportion will have 
first-cousin parents than w^ould be expected on the basis of cousin mar¬ 
riages in the general population. This is the basis of the standard test 
for recessivity of a rare condition. 


19.11 Artificial inbreeding 

For genetical and other purposes it is important to have a stock of 
animals or plants which are as alike as possible. Now this can be 
achieved if all the animals or plants are similar homozygotes, i.e. all 
AA or all aa, for then all the offspring will be similar to their parents. 
AA X AA can give nothing but AA. How'ever, any Aa individuals 
w'ill destroy this homogeneity, so it is usual to subject the stock to 
prolonged inbreeding to make it completely homozygous. 

We shall consider for illustrative purposes a stock of plants being 
subjected to continual self-fertilization. Let us suppose that at first it 
consists of a proportion />o of type A A, a proportion qQ of type Aa, and 
Vq of type aa (so that p^ q „ r^ = i). 

Now' in the next generation AA plants on self-fertilization whll 
give nothing but A A progeny: and aa plants will give nothing but aa. 
But Aa plants will give hAa and }aa. So the proportions pi, 

9j, r, (say) of AA, Aa, and aa plants in the next generation w'ill be 

Pi" Po + 4?0 

9 i = ^9 0 

ri = ko + ^0 


The proportions p^, 92» in the third generation will be connected 
with pi, 9i, Ti by similar equations: w'e wish to know' their values in 
the //th generation. Now' w'c can write these equations in matrix form 
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or (say)/7j' = ap where />□' stands for the column vector [/>(,, q q, r 
Pi for [pi, 9i> ^i]', and a for the matrix of multipliers. Similarly = 
api = a^po', and in general the proportions in the ;/th generation will 
be given by = a^p^ . This can be expressed in more convenient 
form by using equations (18.39) and (18.40) for the «th power of a matrLx 
a. V is here a non-singular 3X3 matrix and L a diagonal matrix such 
that aV = VL, and then a" = The values of V and L have 

been already found in Section 18.14 (P- 533 )> and substituting these in 
the equation />„' = a^p^ ~ VL” P 0 this becomes 

rprr\ ri I 0-1 pi" o 0-1 pi I o-|-^p;,o-| 

Urirl:-: 

When the matrices are multiplied out this gives 

Pn^ pQ iqo — 

= 2-”qo 

— 2-"-' 9 o ■ - (19.6) 

After a few generations 2-" will be very small, and can be effectively 
considered as zero. 'I'he population will then consist of a proportion 
+ Iqoof type AA; and a proportion r„ 2= Tq + type aa. 

1 he heterozygotes Aa will have disappeared. More complicated types 
of inbreeding can be similarly dealt with; see, for example, K. A. Fisher, 
The Theory of Inbreeding, 1949, Oliver and Boyd. 

19.12 Binomial and multinomial chances 

Suppose we toss a coin three times: what are the probabilities of 
obtaining o, i, 2 or 3 heads? 

For brevity, write // for "heads” and T for "tails”. I'hen there are 
eight possible outcomes of three tosses of a coin, all of equal probabilitv* 
TTT, HTT, THT, TTH, HHT, IITII, THII, HJIH. In the first out¬ 
come there are no heads; in each of the next three there is one head, in 
the next three two heads, while in the last case all the tosses come down 
heads. So the probabilities of obtaining o, 1,2, and 3 heads are rcsoec- 
t'vely i, g, g, and 

Now consider a family of three children from an x Aa mating, 
where .d is dominant to a. The children will then be of two distinguish¬ 
able ("phenotypically different”) kinds, which we can denote (for con¬ 
venience) by the letters A and a. Furthermore the probability of a 
child being A is 3, and that of being a is J. Thus we have eight possible 
?"®*^Kements of the phenotypes of the three children arranged in order. 
Ihe arrangement AAA (i.e. all children A) has probability (|)3 == 
p/64. 'I'he arrangements aAA, AaA and AAa all have probability 

ix |x I = These arc the cases in which there are two .d’s: so the 
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chance of getting two A's out of three is ^ ^ = li, hy the 

addition rule. ^ ^ 1 j 

The arrangements in which there is only one A are Aaay aAa and 

aaA', and each of these has chance | X i X i = Thus the chance 
of obtaining only one ^ is & + A + = &■ Finally the arrangement 

aaa with no A*s has chance 

In general suppose that an event or “experiment” B has two possible 
consequences A and a: A (a “success”) has probability p, and a has 
probability Then if we repeat the event or experiment B n times 
(take n “trials”) the chance of obtaining x A*s and y a"s will be 

= Arr (;c,jy)pV . - • (i9*7) 


where Arr {x, y) stands for the number of different arrangements or 
“words” we can form out of x A^s and^' a’s set out in a row. For each 
such arrangement will have probability and since they are mutually 
exclusive events we can add their probabilities to find that of obtaining 
any one such arrangement. 

More generally, if an event E has more than two possible conse¬ 
quences, say A with chance p, B with chance 9, C with chance r, then 
the probability of obtaining x .rtl’s, y B's and sr C’s in n trials or repeti¬ 
tions will be 

P^c.v.z = Arr (.v, y, ^r) p^q^'r^ . • (i 9 - 8 ) 

(x -i-y + z = 7i) 


where Arr (.v, v, ~) stands for the number of different arrangements or 
words we can form from x letters Ay y letters B and s letters C set out 
in a line. For if we write out the consequences of the n successive events 
in order we shall get just such an arrangement. And each separate 
arrangement in which there occurs .v . 4 ’s, y B's, s C's will have proba¬ 
bility p-^q^r^ by the multiplication law of independent probabilities. If 
there are more than three possible consequences A, B, C, D . . occur¬ 
ring respectively .v, y, z, tv, . . . times, the probability evidently becomes 
Arr (.V, y, z,w,..) p^q^r-s*^ . . . where q,r,s, . . . are the probabilities 
of A, B, C, /),... occurring, given E, in a single experiment. 

\Vc shall begin by finding the number of possible arrangements ot 
ft letters or objects . 4 , B,C,D... when each object occurs once and only 
once (i.c. when .v = y = xr - ec - . . . = i). If there is just one letter 
A we have only one possible arrangement or “word” A. If we bring in 
a second letter B, this can be put either before or after the A, giving 
I X 2 -- 2 “words” BA, AB. A third letter C can be added to either 
word BA or AB in any one of three positions; at the beginning, in the 
middle, or at the end. So any two-letter word gives rise to three distinct 
three-letter words; and so the i X 2 possible two-letter w^ords give 
I X 2 X 3 = 6 possible three-letter words, namely CBA, BCA, BAC; 
CAB, ACB, ABC. We can now add a fourth letter D to any of these 
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words in four ways either at the beginning, between the first two letters 
betw-een the second two, or at the end. So there are i x 2 x i x 4 =’ 
24 possible arrangements of four letters. Proceeding in this way we see 
leners *- ^‘‘^'^^Sements or words that can be formed from « 

No\v consider the case in which two letters are identical, say both A 

Cy D, E . . . We can obtain such 
words by taking those m which all the letters are different, and re¬ 
placing Bhy A wherever it occurs. But we must now divide the num¬ 
ber of arrangements by 2. For every' word CBDA ... in which all 
letters are different has a partner CADB ... in which the letters A and 
are interchanged, all the other letters remaining fixed. And these 
two partners will give only one word when B is replaced by A The 
number of distinct words is now j« jz. 

tl, "'6 replace .v distinct letters by a single letter A we must divide 
tne number of words by |v. For each word in which all letters are dis¬ 
tinct will have a number of “partners” obtained by interchanging the .v 
cnosen letters among themselves, and leaving the others fixed. The 
total number of such interchanges will be jx; and they will all give the 

same arrangement when we replace all these letters by A. So the num¬ 
ber ot arrangemerns of „ letters in which .v are A% and the remaining 
(n — x) are all different, is ^ 

Now suppose we go on to replace y of these further letters by B- the 
sarne argument shows that we must now divide the number of words 
y (y and It becomes \nj\x\y. Continuing in this way we finally obtain 
the number of arrangements of ar A'?., y B'sy z C’s, . . . 'I’his is 

Arr (.v.y, a: . . .) = |y , 

Si^stituting back in equation (19.8) we find the probability of obtaining 

W s, y B’s and ;r C’s to be ^ 

P'^ssibilities A and (2 the chance of 
Obtaining x A s and y = (« _ at) a’s is 

px = [« />* V[f j n — X . 

chance of obtaining two dominants in a family of three 
i 3 (miy/l 2 |_i 27/64, as we have already found. 

tion^’^pr/'^” method of “generating func- 

oroKoK-r ^ suppose that E is any event with consequences A, B C with 
pr^ab.htics A and r respectively. We can represent this in th^;;,. 

by a Drnir *^^>1 t^iat E is followed 

evenr^f ^ ^ a^d r of C’s. Now let E be another 

» followed say by consequences H and K with probabilities P and 
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Q respectively. This can be represented by the expression or “gener¬ 
ating function’* (PI/ + QK). We shall further suppose that the conse¬ 
quences of F are independent of those of E. Then on multiplying the 
expressions (pA -h qB -h rC) and (PH + QK) together as if they were 
ordinary algebraic expressions we obtain the product 

(pA -hqB + rC) (PH + QK) ^ pP. AH + qP . BH A-rP. CH 

-\-pQ .AK ^qQ .BK -\-rQ .CK 

and this sets out in convenient form all the possible consequences AHt 
BH, . . . CK of the combined event E and F together with their respec¬ 
tive probabilities. In particular the repetition of the event E is repre¬ 
sented by the product (pA + qB -{- rC) (pA + qB -h tC) => p^AA Ar 
pqAB -b prAC + qpBA r-CC. If however we are not inter¬ 

ested in whether A or B occurs first the two terms pqAB and qpBA can 
be combined to give zpqAB in the usual way, meaning that there is a 
chance zpq of obtaining lA and i 5 , ignoring the order in which they 
occur. Similarly by multiplying out the expression (pA qB ^C)^ 
we shall obtain all possible results of repeating E n times together with 
their respective probabilities. But by the multinomial theorem the term 
containing A^B''C‘ in this expression is 

A^B^C^ = \Ji(pA)-(qB)''(rC)ilxjyl^ 

agreeing with (19.9). 

In particular, if we have only two results A and a with probabilities 
p and q respectively, we can find the probability p, of x A*s in n repeti¬ 
tions by working out (qa -hp^)" according to the Binomial Theorem: 

(qa + pA)^ = PoaM« -h+ . • - + p„a®^ 4 " . . (19*10) 

This means that the right-hand side is the expression obtained on ex¬ 
panding the left-hand side by the ordinary algebraic rules. It will there¬ 
fore still be true if we replace the letter a by the ordinary number i, and 
the letter A by any number t, thus obtaining 

(9 + pt)" = />„ + p,t +p^r- + ... +p,t’' . . (i 9 -ii) 

an identity we shall need later. 

PROBLEMS 

(1) How many different “words” can be formed by rearranging the 
letters of FISH, NORMAL. MINIMUM, ZOOLOGY. MISSIS¬ 
SIPPI? 

(2) What is the chance of getting five dominants out of ten children 
from an Aa x Aa mating? 
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(3) William, of group MN, marries Wendy, and has four children 
of group MN and two of group NN. What can you say about Wendy’s 
blood-group? 

19.13 Approximation to the binomial 

Fig. 19.4 shows, in graphical form, the probabilities />* = |8(i)^ X 
( |8 — X of obtaining a: = o, i, 2, 3, . . . 8 heads in eight tosses of a 



Fig. 19.4—The binomial distribution (iT + JH)** with superimposed 

normal curve (broken line) 

coin. These probabilities will be obtained by expanding the binomial 
expression + 4 //)**, where 7 ’stands for “tails” and H for “heads”, 
and interpreting the term — |8(i)'*-"'( J)^/|8 —x j.v . 7 ’‘*-^ 7 /' 

as meaning that the probability of obtaining (8 — .v) tails and .v heads is 
px. These probabilities are represented by the rectangular areas above 
the axis in the upper half of the diagram: it will be seen that the greatest 
probability is that of 4// and and that the chances fall off on either 
side. 

In the upper part of Fig. 19.5 we show in a similar way the proba¬ 
bilities for o, 1, 2 . . 8 dominants A out of eight children from a mating 
Aa X Aa \ these probabilities are obtained by expanding the expression 
(ifl -f- -yp^a^A'- + , . . +/>and interpreting the 

00 
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Fig. 19.5—The binomial distribution (]a + JA)* with superimposed 

normal curve (Iroken line) 


term as meaning that there is a chance px of obtaining (8 — .v) 

recessives a and .v dominants A. The probabilities no longer form a 
symmetrical figure, but they still have a peak from which they slope 
down on either side. Now the form of these diagrams suggests that the 
value oipx should be approximated to by a smooth continuous curve; 
we have plotted such a curve on each diagram for comparison. Such an 
approximation can be found as follows. 

In the general case />,, the probability of obtaining x A*s and («—a:) 
fl’s is given exactly by 

px = jA’ j« — .V . . (19.12) 

where p is the probability of a single event A and g = i — p the proba¬ 
bility of a. From this we deduce that 


so that 


== px • p{n - x)!g{x + i) 

Px + x ~P x ^ [/)(« - -v) — q (x + 1)] p^ 

+ PPx [?/>(« - .v)'+ pq{x + i)] Px 

= { 7 ip -H 9 — a)/(h -f- i)pq 


(19-13) 
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Now consider the meaning of the left-hand side of this equation. 
The difference — Px represents the increase in when x increases 
from a: to (jc -j- i), i.e. the average rate of increase of between these 
units. If the graph of p^ can be approximated to by a smooth cur\’e 
y = this difference will be approximately the actual rate of change 
Dx 4 >{.x) on this curve. The denominator is 

qpx+i -\-ppx = qpx^i + (i — q)px = px ^ q{px+x — px) 

and since q lies between o and i this lies between the two neighbour¬ 
ing values Paj and px + i- Now suppose that n is very large: then it is 
reasonable to suppose that a change of i in the number x will not greatly 
alter the probability of getting x successes out of n. Thus and />* + i 

will be nearly equal and the expression qPx-y\ + PPx which lies between 
them will be approximately />„ or Thus the left-hand side of 

equation (19.9) can be written approximately as Dxi>{x) ■ <l>{x), where 
is a hypothetical function of x, represented by a smooth curv'e, which 
we are searching for and with which we hope to approximate to />,. 

Since n is large we can neglect q in comparison with np on the right- 
hand side—at least to a first approximation—and i in comparison with 
and write it as {np — x)}npq. So equation (19.9) becomes 

Dx 4 >{x)j<i>{x) = {np — x)lnpq 

But the left-hand side is simply In ^(a:). So on integrating this 
equation with respect to .v we obtain 

In = f {np — x)lnpq dx 

= {npx — ix^)lnpq + C 
= ( — f «*/)- -\-npx—^x‘^)lnpq -t-(C + hi'-p'~) 

— ~{x — npy-jznpq + (C hn-p~) 

or, on taking exponentials, 

^(a:) = g-{x-npyi‘inpq 

Now is simply a constant: but since this method does not tell 

us the value of C we do not yet know what it is. However we shall 
show later that it must be approximately {znnpq)-^, so that finally we 
have 

4 >{x) = {zTTiipq)-^ . (i9 -t+) 

as a rough formula for finding the chance of obtaining .v “successes” out 
of n repetitions of an experiment, the probability of a success in each 
experiment being p and the chance of a failure being q = i — p. 
Actually the approximation is quite good, even for small values of w, 
provided that np is not small. 'I’hus the values of ^(-v) from (19.14) are 
plotted in Figs. 19.4 and 19.5 as the bell-shaped curv'es shown in the 
upper parts of the figures; these curves very nearly go through the 
mid-points of the upper edges of the rectangles in the diagram. 
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19.14 Approximation to a factorial 

|H is by definition the product 1X2X3X... X n. But when n 
is large a direct calculation from this definition is very tedious. There 
are tables giving the values of |« and their logarithms (e.g. see R. A. 

Fisher and F. Yates, Statistical Tables for Biological, Agricultural and 
Medical Research, Oliver & Boyd, 3rd edition, 1948), but it is useful 
to have a general formula of approximation. 

In equation (15.5) put K= —i. We obtain 

Je~^ dx = — |H . e—^(i 4-.v/|i+.vV[2+. . .-!-.v”/|/r) . .(19.15) 

Now take this as a definite integral from o to 00. We have 
— ^-co _g-o — o — I = — I while = 0 for r > o, since 

= o and o’" = o. So (19.15) gives “Euler’s integral”, 


/: 


dx — 17 / 


(19.16) 


But .v" = W'e shall now change the variable from x to X 

X — 7/; then (19.16) becomes (since DxX — Dx(X + 7/) = i) 

\fl = f g-X-n gn In (A' + n) 

I - y n 

—> In n [In(A'4«)— In n] ^ Y 

y 

by a slight rearrangement. W’e can write this as 


7/ = /*" 

— y -/i 


dX 


. (19.17) 


where u = —A' + n [ln(A' //) — In ;/]. 

Now consider the function //. When A’ tends to — n, u tends to —00, 
because of the logarithm In (A' -r- 7/). When AT tends to 00 u tends to 
— CO because the term —A far outweighs the term 7/ In (X + 7/). Thus 
u tends to become large and negative at both ends of the range: it must 
have a maximum somewhere between. We can readily find this maxi¬ 
mum by solving the equation DxU = o, i.e. — i + 7/ •(A’’ + 7/) — o. This 
gives the solution A* -- o. and the corresponding value ii = 0. Thus u is 
always negative except for its maximum value o when v = o. Now 
when u is large and negative, e" will be very small indeed, and will con¬ 
tribute very little to the integral of (19.17). (This will be true in spite 
of the fact that the integral extends as far as 00 : for as X increases u will 
decrease nearly as rapidly as —A', and therefore e" will behave very like 
e~^'. and the integral of e—'’ from any large value of A' onwards is 
negligible.) So the only values of u which need concern us are those 
for which u is not large in magnitude, and they will occur when X is 
near o. But when A’ is not far from o we have 
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Now consider the meaning of the left-hand side of this equation. 
The difference ^*+1 —represents the increase in/>^ when x increases 
from a: to (jc -|- i), i.e. the average rate of increase of between these 
units. If the graph of can be approximated to by a smooth curve 
y = 4 >{x)y this difference will be approximately the actual rate of change 
Z)a.^(x) on this curve. The denominator is 

! +PPx = qpx^i + (i — q)px ^ Px + q{px+x —Px) 

and since q lies between o and i this lies between the two neighbour¬ 
ing values and Now suppose that n is verj'^ large: then it is 

reasonable to suppose that a change of i in the number x will not greatly 
alter the probability of getting x successes out of n. Thus />* and 
will be nearly equal and the expression qPx + i + PPx which lies between 
them will be appro.ximately />*, or Thus the left-hand side of 

equation (19.9) can be written approximately as Dx<i>{x)l 4 >{x)y where ^(.r) 
is a hypothetical function of x, represented by a smooth curve, which 
we are searching for and with which we hope to approximate to />*. 

Since n is large we can neglect q in comparison with tip on the right- 
hand side—at least to a first approximation—and i in comparison with 
tiy and write it as {tip — x)lnpq. So equation (19.9) becomes 

D^ 4 >{x)f<i>{x) = {tip — x)ltipq 

But the left-hand side is simply Z)* In ^(a:). So on integrating this 
equation with respect to x we obtain 

In ^(at) — J{tip — x)ltipq dx 

= {npx — \x^)ltipq + C 
= ( — ■^tipx—^x^)ltipq -i-(C -h 

= —(a: — npYjztipq + (C + hi-p~) 
or, on taking exponentials, 

^{x) — g-ix~np)^l2np<i 

Now is simply a constant: but since this method does not tell 

us the value of C we do not yet know what it is. However we shall 
show later that it must be approximately { 27 Ttipq)-^y so that finally we 
have 

^(:e) = . (19-14) 

as a rough formula for finding the chance of obtaining .v “successes” out 
of n repetitions of an experiment, the probability of a success in each 
experiment being p and the chance of a failure being q = i — />. 
Actually the approximation is quite good, even for small values of ti, 
provided that tip is not small. Thus the values of ^(.v) from (19.14) are 
plotted in Figs. 19.4 and 19.5 as the bell-shaped curves shown in the 
upper parts of the figures; these curves very nearly go through the 
mid-points of the upper edges of the rectangles in the diagram. 
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log ^ log {x^e~^) + ^log X + *3990899 -j- '080929 sin (25*623^;“^)° 
(see Biometrika 6 (1908), 118). 


19.15 The generalized factorial 

We have so far defined |rt only for integral values of n. But the 

/•OO 

integral J ax certainly exists for all positive values of n (and 

in fact for all complex values of n for which the real part of n is 
greater than —i). Therefore we may take (19.16) as defining a function 
yi for values of n which are not integers, and study the properties of 

this function. We see at once, from the arguments used above, that 

(i) |« according to the new definition agrees with that according to 
the old, i.e. 1X2X3X....XW, when « is a positive integer. 

(ii) When n is large, |« is approximated to by Stirling’s formula 
(19.20). 

Also we have by direct differentiation 

DJ^e-^x’^) = — e~^x^ 

Integrate both sides of this expression from o to 00, taking n > o. Then 

= J 7ie~^x”~^ dx — J dx 

= n [« — I — |n. 


But when « > 0 the left-hand side is zero, since = o and o" = o. 
So 

n\ n — I . . . (19.21) 


for all positive values of n. (We can use this equation to define yi for 
negative values of n.) ~ 

Sometimes the general factorial j« is called r(n + i), and known as 

the “gamma function”. This notation was introduced by Euler: but 
it is not clear why he added in the extra 1, which only complicates the 
most important formulas: and several modern authors use the notation 
|w or n\ 

We can find one important property of the generalized factorial 
using double integrals (see Sections 16.12, 16.13). have 


QQ 00 

jm j// — f ^ e-'v"* dx J dy 

— f f e~^e~^x*^y^ d\ dx\ 
■J 0 •> 0 ~ 


I'his integration is to be performed over all values of x and y in the 
quadrant lying above the .r-axis and to the right of the y-axis. Now 
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In (X + n) — In w = In [{X + «)/«] 

= In (i + Xjn) 

= Xln - . 

by the Taylor series for In (i + a:) (Section 13.7): so 

u = —X + 71 [In(A' + ft) — In tt] 

== ~X^!2n + X^jyi^ — + • • . (19-18) 


This Taylor series will be valid provided that \X\ is less than n. 
Suppose [ATI is as large as then the first term will be of magnitude 
ti^l^jzn = n^i^/z and if n is large this will also be large; the second term 
will be = 1/3, which will be small in comparison with the first: 

the third will be still smaller, and so on. Thus when « is large X has 
only to deviate from o by as much as «2/3 become large and 

negative; and as we have said we are not interested in the values of u 
beyond this point. Furthermore, since the ratio second term/first term 
is {X^jyi^)l{—X'^l2ti) = —gAT/n, so long as \X\ < h-/® this is a small 
quantity (smaller in magnitude than and so the second term can 

be neglected in comparison with the first. The third term will be still 
less important, and we shall be justified in keeping only the first term 

in (19.14) and writing u = —X^jzti: so /** dX — /* f^x. 

To evaluate this integral we shall change the variable from X to y = 
X( ■\/ 7 t, SO that X = y\^n. 'Fhen this integral becomes 



{dXjdy) dy = y/ti 


r dy. 

y - V ^ 


But when w is large, \/w is large, and the integral from —\//; to 00 of 
the expression is nearly equal to the integral from —00 to 00. 

(When y is large and negative, e-*"' is a very small quantity indeed, and 
the difference between its integrals starting from —00 and — is 
negligible.) We have not yet evaluated the integral from —00 to 00 of 
e-\v^ with respect to y. but we shall show presently that it is \'(2rr). 

So finally we sec that j ^ e'‘ dX zl y/u . y/zn, and so from (19.17) 


^ tn n-n ^^ 27 J-«) . . (19.I9) 

or In = (w -j- \) In « — m + ^ In Zn 

or log |« = {n 4- ^) log n — ti\og e h log z-n . (19.20) 

in terms of common logarithms. This is “Stirling’s approximation to 
and although it has been obtained on the assumption that n is 

large, it proves to be surprisingly accurate for even only moderately large 
values of «. Thus for [Ht gives *922 instead of i, and for [5, 118 instead 

of 120. K. Pe.arson showed empirically that a very accurate formula was 
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In the particular case n = —^ this gives, using (19.25), 


—^ f 


.-iv- 


^dy. 


So f dy = \^(l 7 r). Similarly /"^ dy = \/(^ 7 t), since 

•'0 v ^ 00 

has a bell-shaped graph which is symmetrical about the origin: 
g-i(-vr = e~^^\ So finally 


J dy = p dy + J dy 

^ ZyXhn) = ^{ztt) . . . (19.27) 


This was the result we had to assume to derive Stirling’s formula. 

Another property of the factorial function which we shall not prove 
here is 

\n j—w = tiTT cosec mr . . (19.28) 


provided that n is not an integer. Notice that if we write .v 7 |« as (x)n 
we have the following curious symmetry: “ 

00 

(i) Sum over n: ^ e“" (.v)„ = i 

n « o 


Integral over x: ^“"(.v)^ 


= I. 


The first is the exponential series in disguised form, the second is 
Euler’s integral. 

N 

(2) Sum over ^{a - .v).v_„ (.v)„ = (a).v 


n » o 


Integral over .v; f“ {a - ,v).v_„ (.v)„ <iv = 

The first formula is the binomial series and the second the beta 
integral. The extra i in the second lormula probably comes in from the 
‘W.v” in the integral: we can rewrite this formula as 

•ora 

4v)n -= («).V 

J i) 

and make it a perfect analogy. 

We also have from (19.28) 

(.v)n (.v)_„ sin nirhiTT 
provided that n is not an integer. 
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20.1 Natural variation 

History may or may not repeat itself, but nature never does. No 
two animals or plants are ever exactly alike, not even identical twins. 
Neither does man repeat himself: two measurements of the same 
quantity will rarely give exactly the same answer. If we take the heights 

of all of the population of a town we shall obtain a certain distribution of 
heights. 

Distributions are of two main types, “continuous” and “discontin¬ 
uous”. A discontinuous distribution arises by counting; for example, 
among a large number of litters of mice we shall have some containing 
one mouse with a certain abnormality, some containing two, and so on: 
but no intermediate grades of one-and-a-half mice! In the same way 
there will be a distribution of family sizes in a human population. But 
if we measure heights then every height appears to be possible, at least 
within a wide range: the distribution is continuous. The two types of 
distribution have very similar properties, but the discontinuous ones are 
somewhat easier to deal with mathematically. 

We also meet here with the problem of sampling. A number of 
individuals extracted from a population constitute a “sample”, and it is 
convenient to try to infer the properties of the population from those 
of the sample: indeed this is often the only practicable procedure. But 
to do this we must make sure that the sample is representative of the 
whole population. The simplest way to obtain such a properly repre¬ 
sentative sample is to choose a set of members of the population at 
random. Or at least that is the simplest method in theory, though there 
may be practical difficulties. Accordingly, in what follows we shall im¬ 
pose two conditions on our sample. The first is that it is randomly 
selected; the second that it is only a small fraction of the population, i.e. 
that the population is so large it may be considered as effectively infinite. 
If not certain corrections must be applied to some of our formulas—see 

f^ example F. Yates, Sampling Methods for Censuses and Surveys 
(Griffin, 2nd edn., 1953). 

20.2 One>variab]e discontinuous distributions 

Consider the operation of tossing three coins simultaneously. If 
this operation is repeated n times in all there will be a certain number or 
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“frequency” of occasions on which there are no heads, a certain 
numberof occasions where i head appears,/a with two heads, and/3 
with three. For example in 100 throws we might find/(, = 10, /i = 34, 
/2 = 45 » fz = By definition /„ +/i 4-/2 +/3 = n = 100. This 
gives us a distribution of the discontinuous variable x = the number of 
heads in a throw of three coins. (It is discontinuous because it can only 
take the values o, i, 2, and 3, and no intermediate value such as 
unless that was assigned to a coin which happened to stand on edge!) 

The proportions /q/w = -lo,//w = *34 ,/j/m = *45 and/g/n = *11 
are the “relative frequencies” of ^e occurrences of the values o, i, 2 and 
3 of X in such a sample. 

There are Uvo convenient ways of representing such a sample dia- 
grammatically. The first is the “histogram” (Fig. 20.1) (Greek histds, 
a mast or sail) by which the frequencies are represented by rectangular 



Fig. 20.x—A sample from a population, represented by a histogram 

and a cumulative frequency curve 

areas placed above the x-axis, each area being proportional to the corres¬ 
ponding frequency. By a suitable regraduation of the vertical axis this 
can equally be arranged to show relative frequencies instead of absolute 
ones. 

The second method is to use the “cumulative distribution” of all 
frequencies of x up to and including a given value. There are Fq =/o 
= 10 cases in which .v ^ o, =/(,+/, = 44 cases in which .v — o 
or I, F2=/o H-/i 4-/2 = 89 cases in which .v — o, i, or 2; and 
■^ 3 =/o +/i 4-/2 4-/3— 100 cases in which .v = o, i, 2, or 3; 
clearly F^ — «, the number in the sample. In general the cumulative 
frequency F^ is defined as the total number of cases in which the variable 
does not exceed .v in value. The graph of F* plotted against x will con¬ 
sist of a number of horizontal segments shown in the lower part of 
Fig. 20.1. (In this graph x has been allowed to take fractional as well as 
integral values: thus F.^ means “the number of cases in which x < *9” 
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and is therefore the number of times that .v = o, i.e. 10. This allowance 
of fractional values of jc in the definition of is purely a matter of con¬ 
vention, but it will help us later to relate the theorv to that oi continuous 
distributions. The segments of the graph are joined by dotted vertical 
lines as a guide to the eye.) 

By regraduating the vertical axis we may equally well consider this 
as a graph of the relative or proportionate frequency FJn. 

Now when the sample becomes large the observ ed or sample relative 
frequencies fjn can be expected to approach the probabilities of 
obtaining the various values of x. These are known as the “true” or 
“theoretical” relative frequencies. (These are not ver>- good words to 
express the distinction, but it is difficult to find better ones.) These can 
also be represented by a histogram, or by means of a cumulative cur\ e 
giving the total probability of obtaining a value not exceeding .v: 

^0 ~ P oi P ^ — P 0 P\ poy etc. 7 ’hus Fig. 19.4 

gives the histogram of the true distribution of the number of heads ob¬ 
tained in throwing eight coins, and below it the cumulative probability. 
In Fig. 19.5 we have the histogram for the distribution of the number of 
recessives a obtained in a family of eight from an Aa x Aa mating. 
These probabilities0. />i. • specify the true distribution completely, 

just as the observed frequencies /0, /,. /z . • . specify the sample. 


20.3 The mean value of a distribution 

Now in general the specification of all the separate frequencies /o, 
/i, etc., is a very cumbersome way of describing a distribution—though 
it is the only complete description. But in most cases we rather require 
a small number of quantities which enable us to visualize the main 
properties of the distribution, though not its exact form. One of these 
is the “mean” or average value, obtained by adding the observed values, 
and dividing by the sample number u. 'I'hus if the sample contains five 
observations, .v, = i, = 3, .Y3 = 3, .v, - 2 and .V5 = 4, the mean is 

+ 3 3 -f- 2 + 4)/5 = 2-6; and in general if the n observations are 

^1. •V2, . . . .v„ respectively, the mean is x = (a-, + .Vg + . . . 4 .v„) V; = 
n~^Pxa. However, when the observations are grouped with given 
frequencies (/„ observations for which v =--- o, and so on) there is a 
shorter method of calculation. In the example of the last section there 
were ten occasions on which .v look the value o, and 34, 45 and 11 on 
which X took the values i, 2, and 3 respectively, 
all the at’s is 10 X o q- 34 x i q- 45 x 2 q- 11 
by « = 100 we find the mean of x to be 1-57. 
vertical line crossing the histogram. 

In general x takes the value o in /<, cases, 


'Therefore the total of 

^ 3 = » 57 - Dividir)g 
'This is shown by the 


and the total of all the .v’s is Tj. 
mean is therefore 


I in /, cases, and so on, 
o/o H- i/i I 2/2 q- ... = 2\/,. 'The 


.V = Pxfjn 


(20.1) 
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summed over all possible values of the variable x. (Strictly speaking, 
according to our convention we should use a Greek letter, Z a/a, in this 
sum: but it seems clearer in this instance to use the letter x.) 

Looking at the matter in another way we can say that the mean x 
is the .v-co-ordinate of the centre of gravity of the histogram. 

Now when the sample is very large,/p/w will be very nearly equal to 
po, the probability that x = o\ fjfi will be nearly equal topi; and the 
formula (20.1) for the mean will approximate to Z xp^ = opp -h ipj + 
2p2 + . . . This is called the “true mean*’ or “expected value” of x, 
and denoted by the symbol tx. So 

tx = Zxpg . , . (20.2) 

Thus in the distribution of the number of heads obtained by tossing 
three pennies, pp = J, pj = p2= h and pg = and the true mean 

number of heads is opp + ipj + 2p2 + 3P3 = I + ^ f f = 1-5. 
In our hypothetical sample we found a mean x = i -57, not very different 
from this. 

20.4 Variance 

Another important property of a distribution is its spread or dis¬ 
persion. Can we expect the obser\'ed values to be distributed over a 
wide range, or will they be clustered together.? For example, suppose 
we measure the weights of a heterogeneous collection of adult mice, and 
also of a pure inbred strain. In each case the weights will have a certain 
distribution, but we shall expect those of the inbred strain to differ only 
slightly from one another, since they will be practically identical geneti¬ 
cally, while those of the heterogeneous group will have much greater 
variability. Ihe average weight for the inbred strain may also be 
noticeably different from that of the mixed group: but this will not 
necessarily be the case. 

There are several possible ways of measuring this spread or varia¬ 
bility. If we take a sample and find its range, i.e. the difference between 
the greatest and least values in the sample, we have one such measure. 
But this is as a rule not very suitable, since the larger the sample the 
more exceptional values it is likely to contain, and therefore the range 
will probably increase as we take larger and larger samples. For example 
a small sample of a British adult population would probably consist of 
persons between 1-5 and 2 metres in height: but by extending the sam¬ 
ple sufficiently we could find a few dwarfs of considerably smaller 
height, as well as a few giants exceeding 2 metres. No doubt there are 
certain upper and lower limits to the height of a human being beyond 
which life becomes impossible: but these limits are not known with any 
accuracy. 

Another measure is the average deviation from the mean, or “mean 
deviation”. Consider the sample of the preceding section, consisting of 
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five observations i, 3, 3, 2, 4, with mean 2-6. The deviations from the 
mean are respectively i — 2 6 = —1*6, 3 — 2-6 = 4, 3 — 2*6 = *4, 
2 — 2-6 — —6 and 4 — 2-6 =14 respectively. These cannot be use¬ 
fully averaged as they stand, for their average is } ( —1-6 -f- *4 + 4 
—6 + 1-4) = o; the positive and negative deviations cancel out. But 
if we make all the signs positive, obtaining i -6, 4, 4, -6, and 1*4, respec¬ 
tively, we shall find a mean deviation + -4 + 4 -i- -6 + i'4) = 

5(4*4) = *S8* In general the mean deviation will be defined as the 
average value of \x — x\, where x is the sample mean. If the w obser¬ 
vations in the sample are separately specified, say as a:,, aTj, . . . then 
X = £Xa, and the mean deviation is w"* £\xa — 5 ]. If the obser¬ 

vations are grouped together, so that there are/o cases in which .v = o, 
fi cases in which a: = i, and so on, then the mean deviation will be 
— x\ln. 

This measure of spread has the advantage that it will not be greatly 
affected by a few exceptional values in a large sample, provided that 
these exceptions do not deviate excessively far from the main part of the 
distribution. But there are certain objections to it on theoretical grounds. 
One of these is that the rather crude device of changing all the signs to 
plus makes it difficult to handle mathematically. So in practice it is 
never used. Instead the usual measure is the “standard deviation” a. 
This is based on a very similar line of thought, but uses the fact that the 
square of the deviation is always positive. Before introducing the 
formal definition, however, it may be useful to go through some pre¬ 
liminary discussion to make clear the underlying ideas. 

At first sight it seems natural to consider using the average squared 
deviation as a measure of spread. Thus in our example the deviations 
from the mean arc —1 -6, -4, 4, —-6, 14; the squares of these numbers 
are 2-56, 16, 16, -36, and 196 respectively, and the mean squared 
deviation is therefore 1(2*56 -f- *16 4- *16 -|- *36 + 1*96) = 104. We 
can find a mechanical analogy to this; if we imagine the five observations 
to be represented by five masses, each equal to 1 in suitable units, placed 
at the points i, 3, 3, 2, 4 respectively, then the mean, x — 2*6, corres¬ 
ponds to the centre of gravity of the system, and the mean squared 
deviation to the moment of inertia about the centre of gravity. 

However, it is better to modify this definition slightly. In most 
cases what we would like to calculate would be the mean squared 
deviation from the true mean tx: that would be r(A*a — 

where the observations arex,, X2, . . . x„. But as a rule we do not know /x, 
but have to use the sample mean x instead, and for small n this makes an 
appreciable difference. It can be shown that it decreases the mean 
square on the average in the ratio (n — i)/;/ and will be corrected for by 
dividing the sum of squares of deviations by (n — 1) instead of u. 
Accordingly we shall call 


V == £(xa — xyi(n — i) 


(20.3) 
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the “sample variance”, and consider it as a measure of spread or dis¬ 
persion. In our example this becomes 

V = (2*56 + 16 + -16 + 36 + I'96)/4 = 1-30 

Note I. —This use of the divisor (w — i) instead of n is not universal: 
the reader will find books which use n. However the divisor (« — i) 
seems the best. It is the standard divisor used in the calculation called 
the “analysis of variance”, and known as the “degrees of freedom” or 
“d.f.”. Furthermore, if we consider an extreme case of a sample con¬ 
sisting of only a single observation a-j, then a, — x, and formula (20.3) 
gives an indeterminate variance v — 0/0. That is reasonable, since a 
single observ'ation gives no information about the spread of a distri¬ 
bution. If we took the divisor ?/ = i we would always find = 0, 
whereas further observations would be likely to show a non-zero spread. 


If the obser\'ations are grouped, so that there are/^ of value o,/i of 
value I, and so on, then the sum of squared deviations from the mean 
will be 


■S =/o(o - xf +/,(! - xf +f,(2 - if + ... = Zf^^x - Xf 

‘ (20.4) 

For each observation x = o will have a squared deviation (a — xY = 
(o — x)‘y and there are/p such observations; similarly each of the/i 
observations for which .v = i has a squared deviation (i — a)-, and so 
on. Thus in our hypothetical example in which /□ = 10, /j = 34, 
/2 = 45 /s = II, we have .v — 1*57 and 

S = 10(0 - 1-57)2 -f 34(1 - 1-57)2 + 45(2 — 1-57)2 -f 11(3 - 1-57)2 
= 66-5100. 


We now have by definition v — S/(w — i) = 66-51/99 = -6718. 

Considered as a measure of spread there is still one disadvantage to 
the use of the variance. It is the average of the square of the deviation 
from the mean—apart from a correcting factor ?«/(« — i), which is nearly 
equal to i when n is large. But it is natural to require a measure of dis¬ 
persion to be in some way comparable to a deviation itself, rather than 
the square of such a deviation. For that reason it is usual to calculate 
the square root s of the variance; this is known as the standard devi¬ 
ation. It is a sort of average deviation, slightly greater as a rule than 
the “mean deviation” defined above, but more tractable from a mathe¬ 
matical point ot view. In many cases it is approximately true that the 
standard deviation = ^ X the mean deviation; and also that in a large 
sample only about one obse^^’ation in three deviates frqm the mean by 
more than the standard deviation. These rules are (nearly) true for the 
“normal” or “Gaussian” distribution, which we shall study later; and 
many distributions which occur in practice are approximately Gaussian. 
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In the first of our two examples, in which the obsen-ed values of .v 
were i, 3, 3, 2, and 4 respectively, the variance was calculated to be 130: 
the standard deviation s is therefore 'v/i-3o= 114. In the second 
example, where = 10, = 34, = 45, /a = 11 we found v — 
•6718, and so s = \/v = -Sao. 

Now just as the sample mean x is usually regarded as an approxi¬ 
mation to the “true” mean /x = Cv of the distribution, so the v’ariance v 
can also be regarded as an estimate of the “true” variance v. By this 
we mean that when the sample number ;/ is large the variance v calcu¬ 
lated from the sample will differ inappreciably from v. Now we know 
that (by definition and equation 20.4) 


V = Sl{n — i) = — i) 



(20.5) 


where the summation is over all possible values of x. But when ri 
becomes large the fraction ^^/(/^ — 1) tends to i, the fraction f^ln tends 
to the probability p^, and x tends to the true mean /x = Cx. I'hus v will 
approximate to 

V = ^ Px{x — . . . ( 20 . 6 ) 

The square root of u, called ct, will be the “true'" standard deviation of 
the distribution, i.e. the standard deviation of a very large sample. 

Thus for the distribution of the number of heads obtained by tossing 
three pennies wc have />„ = i, />, = g, /.j, = g, j and ^ t.v - 

From (20.6) we therefore find the variance 


y = />o(o “/x)2 -h/),(l - 

= + 
= g = *750 

and standard deviation 


+ p 2(2 ~ fxy 4 - 7)3(3 _ 

my + m^- 


a = V'750 = 866 


as compared with the sample values v = *672, s = *820. 

.. ^ the sample s.d.. and a for the 

true or distribution s.d. follows a convention, proposed l)v K. A. 

Fisher, that Latin letters should be used for values obtained from a 
sample (of moderate size), and Greek letters for the corresponding 
true values obtained from a very large sample. Unfortumuelv this 
convention IS not always adhered to, and in particular the letter o is 
often used for the sample standard deviation. F. Yates, in his book on 
!>ampltng methods for censuses and surveys, has proposed an alternative 
convention, that “true ” values should be printed in heavy type -e.g. 
o for the true s.d. 1 his again has much to recommend it, hut it remains 
to be seen whether it will be generally adopted among statisticians. 
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Note 3.—If we suppose that for each integral value of jc a mass />*, 
equal to the true probability of attaining the value a:, is placed on a 
uniform scale at the point .r, then 

the total mass is i; 

the mean m = Sxpx corresponds exactly to the centre of mass; 
the variance v = ^px(x — fj.y corresponds exactly to the mom¬ 
ent of inertia about a perpendicular axis through the centre of mass; 

the standard deviation a — ■\Jv is equal to the radius of 
gyration. 


Note 4.—In the coin-tossing considered above, the variable x = the 
number of heads can only take a finite number of values, viz. o, 1,2, or 
3. It is possible to conceive of cases in which the value of x is not 
bounded in this way. For example we might take the obser\'ation x to 
be the number of throws made on a single coin before it falls heads. 
Here there is no bound we can place for certain on the value of x\ it is 
possible, though fantastically improbable, that the coin would fall tails 
for millions of successive throws. Now in such a case the sums 

fi = pxX and u = px{x — /i)- 

will become infinite series, and there is a possibility that these series 
may sometimes not converge. In such a case we could not speak of the 
distribution having a true mean or variance at all. However this compli¬ 
cation is fortunately rare in practice, and will be ignored in this book. 


20.5 Simplified scheme of computation 

1 he equations (20.3) and (20.5) which define the sample variance 
V can be put in another lorm which is more convenient for computation. 
They are both ot the (orin v ~ S '{ji — i), where S is the sum of squared 
deviations from the mean, i.e. S = T(.Ya — .v)= in the case where the 
observations are set out separately, or 5 — T/^(a — x)- when they are 
grouped. Now 




— 2.V .Va + X-) 

w XiX “P Ar 


where there is one term in each summation for each obsenMtion Xr. 
Now T .Ya is by definition the total or sum of all the observed values; 
and this can also be written «.v, since the mean x is by definition TJn. 
The last term S x- is the sum of n equal terms .v®, one for each obser¬ 
vation .Yr, and so is equal to w.v-. Substituting in these values we obtain 

•5 = T .Va“ — 2 :y . nx -|- nx^ 

— Z .Ya" — W.V- 

= T .y„2 - 
- T.Ya^ - TxMn 


(20.7) 
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Now in this equation the expression Z x^~ means the sum of squares of 
all the observed values, e.g. i 2 + 32 32 ^ 2^ + 4^ = 39. This is 

often called the crude sum of squares’* T The term to be subtracted 
from this can be written as nx~, or as or as r//«; this is the “correc¬ 
tion for the mean”. In our example the correction is T'^.v = 13 x 2 6 
= 33*8. The sum of squares of deviations, or “corrected sum of 
squares” 5 is therefore T. — T^x = 39 - 33 8 = 5 2. Finally we 
divide by (« — i) = 4, the "degrees of freedom ”, obtaining a sample 
variance v = Sj{n — i) = 5-2/4 = 1-30, as before. 

1 here are two points to be noted here. The first is that the “correc¬ 
tion for the mean” can be conveniently written either as i.e. “total 
times mean , or as 7 “square of total divided by sample number”. 
Which one to use is largely a matter of personal preference. But when 
using the form “total times mean” it is always advisable to calculate the 
mean to two or three more figures than will be required in the final 
answer, for very often several figures are lost in the subtraction. 

The second point is that the quantity S or “corrected sum of 
squares” is often needed for its own sake, and deserves a distinctive 
name of its own. T he name “deviance” has been suggested, and is 
perhaps the most suitable. 

When the observations are grouped, the formula for the “crude sum 
of squares” T2 becomes x%. The “correction for the mean” is still 
— T^x (or and the deviance S is accordingly T. — T^x, as 

before. It is convenient to set the calculations out in a* systematic 
manner, as follows (Table 20.1): 


Table 20.1—Calculation of the mean and variance 


X 

fx 

\ Xfx 

1 

! x^fx 

% 

!i *■+■ 

(■'■ + I )/x 

1 

1 (x+ty/x 

0 

10 

0 

1 ® 

11 

'' I 

10 

4 

10 

I 

34 

34 

34 , 

2 

68 

13^ 

2 

45 

90 

180 1 

3 1 

135 

405 

3 , 

11 

33 

99 

1 4 1 

44 , 

I *7^i 

Total 

100 

n 

157 

313 

T2 

1 

/ 

257 

1 

727 


X - TJn I 5700 

s - - r^- ^ 313 - (157) (I-57) = 66- 

V = Sl{n — i) = 66-51/99 — -6718 
S = \/v = -820. 



PF 


Checks-. T(x + 1)/^ ~ n, i.e. 257 = 15-7 -p 100 

+ 2 H- «; i.e. 727 = 313 -f 2 X 157 + 100. 


T I )% 
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The first column sets out the possible values of x, and the second the 
observed frequencies with which these values occur. The numbers 
xf^ in the third column are obtained by multiplying the corresponding 
numbers in the first and second columns; and those in the fourth col¬ 
umn by a further multiplication by the number x in the first column. 
Now the total of the second column will be = n, the number of 
observations; the total of the third column will be = T*, the sum 
of the observations, or total, and the total of the fourth column will be 
Tz — the “ crude sum of squares”. Given these we calculate 

the mean x = TJn, deviance S= T^ — T^y variance v = 5 /(w — i), 
and standard deviation s — ^/v in the usual way. The last three col¬ 
umns provide an arithmetical check: we repeat the procedure with 
(a: i) instead of x. We should find ZfJ^x -k 1) = Zf^ x -f- Zf^ = 

+ «; ZfJ^x + if = Zf^ x^ + 2Zf^ X -\-Zf^= + zT* + n, and 

in fact we do. 

20.6 True mean and variance of a binomial distribution 

In most distributions which occur in practice the only way of finding 
the true mean and variance accurately is the empirical one of taking a 
large sample, and finding its mean and variance. Thus there are (at 
least in our present state of ignorance) no ways of predicting the number 
of bristles to be found on a Drosophila, the number of birds inhabiting 
a given area or the number of young in a litter of mice; these have to be 
discovered by observation. But there are a few distributions where such 
a prediction is possible. The most important of these is the “binomial” 
distribution. 

This is the distribution of .v. the number of times an event occurs in 
a fixed number N of trials, when the probability of its occurrence in 
any one trial is p. We have already had an example in which a coin is 
tossed A' — 3 times, and .v is the number of heads. Here p, the proba¬ 
bility of heads in any one trial, is i. There are also many genetical appli¬ 
cations. For example we can mate two Yy pea plants, and count the 
number .v of green (yy) peas out of every ten picked. Here N = 10, 
and the probability p that any one pea shall be green is If we take a 
large number of groups of ten peas we shall obtain varying numbers of 
green peas, and these numbers will follow the binomial distribution. 

Now we know that if the probability of an event occurring in one 
trial is p, then the probability px of it occurring .r times in N trials is 
\N p^q^-^j'x \N—x, where q=i—p (Section 19.12). The mean 

number of occmrences is therefore (formula 20.2) 

^ 1 = C\ = Z xp^ 

= opQ + ^Pi “!" -p2 + • • • + A^/>7v 

Now this series can be summed as follows. The first term is zero, and 
the sum of the following terms is (using the relation |.v — .vjar ~ i) 
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fj. = 


1 i_ - P ■ . 2 . 

'— ( jl [A'' — I |2 jA^ — 2 


N .p^ .qO) 

• • I “• 


= Mijsj—j * J 


,N-l 


NiN-ip\ - 

'- I |o 1^ — I 


+ 


P 9 


N-2 


jl |A'’ — 2 

T 2 


+ . . . + 


Np ) ijV-zipy-^ ^ p'q 
I 1° 1-^^ — I 




+ . . . + 


fYi° ' 

p.V-1 gO I 

| A^' — I 1^1 

I p^-iqO 
I A'—I jo 


But the expression within the brace brackets is simply the binomial 
expansion of (q + />)^■^ and is therefore equal to i. Thus fj. = A’p. 

This result would indeed be expected from the definition of the 
probability as the proportion of cases in which the event occurs. For if 
it occurs in a proportion p of cases, one would expect it to occur, on the 
average, in Np cases out of every N. 

The formula for the variance is not so immediately obvious, and in 
order to derive it we need the sum of the series A = 2 ^ x(x — i)pg 

= 2.i./)2 + 3*2‘^3 + ••• T A'(A^ — i)Av- Now ix = 

x^x — I = x{x — i) |x — 2, and therefore ~ 


A = 


_ 2.1.p^q"^-^ x{x—l)^IV p’^q^"^ 

.— 4 ”, , , 4 “ — 


2 |N —2 

x,.V-2 


= r 


\N 


4 - ... 4 " - 


X |A^—AT 

px-2 qN-x 


+ • • . 4 " — 


N{N~i)\S p>'’q^ 


iN |o 


|a:— 2 jA^—A' 4- 2 


- 4 - . . . 4 - — 


..V-2 


N —2 IO 


= N{N- i)p^{q 
= N{N-i)p^ . 

(A similar argument shows that 


( 20 . 8 ) 


^PxX{x — l){x ~ 2) == N {N — l)(A^ — 2)/)® 

2 :p^x{x — i){x ~ 2){x ~ 3) = A'(A^ - I)(A^ _ 2) (N - 3) A (20.9) 
and so on.) 


Now by (20.6), 

V = Z pj^x — fi-Y 

= Z — 2 tix 4- 

= Zp^[x{x — i) 4 - a:(i — 2^) 4- A“] 
=^Zp^(x — 1) 4- (i _ 2/x)Z’/>^V -j- fx^Zp^ 

= A 4- (1 — 2 fx)fj. 4- ^2 I 

= N(N - i)p2 4- (i - 2Np)Np 4- {NpY 

= — 7 Vp 2 4- Np 

= Np{i —p)^ Npq. 

Thus the true mean and variance are respectively 

= Np, V = Npq 


. (20.10) 
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This expression Npq for the variance is not surprising. For if an event 
has probability p = ooi occurring, it will always occur exactly o times 
out of every N. Thus x is fixed at the value o, and has no variability—as 
is shown by the value v = Npq = o. Similarly if /> = 1,^ = 0, the 
number x of occurrences of the event in trials must always be exactly 

and cannot vary. Again we find v = Npq — o. But with values 
of p lying between o and i the number of successes in N trials will no 
longer be fixed, but may vary from one set of N trials to the next. The 
variance v — Npq is then positive, since iV, p, and q — i — p are all 
necessarily positive. 

20.7 One>variable continuous distributions 

Clearly by measuring the height x of every person in London we 
can find a certain distribution of heights. But in contrast to the case 
in which x represented the number of heads obtained by the tossing of a 
coin, here x is not limited to a finite number of distinct values but can 
take any value in a continuous range. Our definitions must therefore 
be modified. 

It is no longer useful to speak of the probability/)* that the height of 
a person is exactly x: for it is most unlikely that any person will be 
found to have any given height, such as 17000000 metres, assuming 
complete accuracy of measurement. So />* is zero, or at any rate will 
not be appreciably different from zero. But we can imagine that if we 
specify two heights, say .Vj and .Vg, there will be a certain probability 
that a person chosen at random will have a height benveen Xi and x^. 
Thus in the adult male British population, about 53 per cent have 
heights between 1 70 metres and i -So metres, and this is equivalent to 
saying that a man chosen at random has a probability -53 of having a 
height in this range. 

It is useful to define P* to be the probability of having a height, or 
other measured quantity, not exceeding x. This will be called the 
“cumulative distribution function*’ P*: its definition is identical with 
that of P* for a discontinuous variable. In general it is not possible to 
predict its value on theoretical grounds. But if we take a sample of n 
individuals, and find that P* of them have heights not exceeding a?, 
then Pj./« is the proportion of such individuals in the sample, and this 
forms an estimate of P*. The larger the sample number «, the more 
nearly this will approximate to the true value P*. Note also that, since 
the probability of obtaining a height .v exactly is negligible, we can 
interpret P* alternatively as the probability of obtaining a height 
less than .v (this being practically equivalent to a height not greater 
than .v). 

Once the function P* has been found all the other properties of the 
distribution can be deduced. ForexampIe,letA:iand.V2beanytwogiven 
heights with .v.^ > Xi. Then P*^ is by definition the chance of obtaining 
a height less than .Vg, and by the addition law of probabilities this is the 
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sum of the chances of obtaining a height less than and of obtaining 
one between a:, and JCg. 

= Px, + Prob. of height between a:, and .V 2 , whence by sub¬ 
traction the chance of a value between jf, and .Vg must be . 

We can if we wish represent the distribution by drawing the cur\'e of 
the function jy = P^. This is shown in the lower half of Fig. 20.2: the 



Fig. 20.Z — The probability density ^(x) and cumulative probability Px of 

a (normal) distribution 

chance of a value lying between x, and a- 2 will be the difference between 
the y values for these two values of .v. 

But this theoretically convenient method of representation has some 
practical disadvantages. We can suppose that there is some lower 
bound // to the height of a man, below which adult life is impossible; so 
that when .v is equal to or less than h, P^ is zero. We can also suppose 
that Uiere is a maximum possible height H; since the actual height is 
certainly less than //, this means that P^ = i whenever .v > IP So the 
graph of P^ will necessarily be of the following form: it will remain at 
zero for all values of .v up to h, it will then climb steadily upwards until 
.T= //, and thereafter remain at the value i. But, unfortunately, 
within ver>' wide limits all graphs of this type tend to look rather similar 
to the eye. They will indeed have different shapes, but these will not be 
strikingly different. So it is usual to adopt another method of represen¬ 
tation. 

This alternative method is based on the histogram of the grouped 
distribution. As an illustration we shall take some data on the duration 
of pregnancy analysed by M. N. Karn and L. S. Penrose (Ann. Ptwen. 
Lond.^ 16 (1951), 148-164) as part of an investigation on the relation 
between the birth weight of a baby, its gestation time, and its chance of 
survival. I he data in question included the gestation time for 6419 
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normal female births—more precisely, for the 6419 non-twin female 
children born in University College (Obstetric) Hospital in the years 
1935 to 1946 who survived at least twenty-eight days. Now clearly it 
would be unnecessarily laborious to write out all the 6419 gestation 
times to specify the distribution. It is sufficient for ordinary purposes 
to divide the range of gestation times into a number of smaller sub¬ 
ranges or “intervals”, and specify the number of cases falling in each 
interval. Thus it was found that, among the 6419 births considered, 
there were three having gestation time between 200 and 209 days (in¬ 
clusive), 14 between 210 and 219 days, 25 between 220 and 229 days, 
and so on. In effect we have thus reduced the continuous distribution 
of gestation times to a discontinuous distribution by a process of “group- 
ing”. 

This grouped distribution is shown graphically in Fig. 20.3. Here 
the scale of .v, the gestation time, is marked along a horizontal line; and 



Fig. 20.3—Histogram of gestation>time distribution, with a normal 

curve superimposed for comparison 

this line is surmounted by rectangles. The area of any rectangle repre¬ 
sents the number of values of .v found within the corresponding range. 
For example the first rectangle, covering the values of a: from 200 to 
209 days inclusive, will have an area equal to 3 in suitable units—almost 
negligible in a sample of 6419. The next will have an area of 14 units— 
still very small: the third inter\'^al, from 220 to 229 days inclusive, an 
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area 25, and so on. As will be seen from the figure, we obtain in this 
way a bell-shaped histogram very similar in form to that of the binomial 
distribution (see Fig. 19.4). 

Experience, combined with certain theoretical considerations, shows 
that the most satisfactory results are obtained when the whole range of 
the distribution is covered by between 10 and 25 intervals or “groups”. 
If too few groups are taken, the effect will be to obscure the general 
form of the distribution and to introduce inaccuracy into the calculations. 
If there are too many groups there will be so few cases in each interval 
that random fluctuations in the frequencies will giv'e a v^erv' irregular 
shape to the histogram: and also the work of calculation and tabulation 
will be greatly increased. 

As a rule too it will be most conv’enient to use equal intervals of 
grouping, as we have done above. Sometimes it may be necessary to use 
unequal intervals, though this should be avoided whenever possible. 
There is then no difficulty in drawing the corresponding histogram, but 
it is important to keep in mind that it is the areas and not the heights of 
the rectangles which represent the frequencies, 'rhe reason for this rule 
is that a wider interval of grouping necessarily gives a larger frequency, 
quite apart from other considerations; and it seems unfair that any 
rectangle of the histogram should have its height increased simply 
because its base is widened, as would be the case if the frequency was 
proportional to the height. Hut if the frequency is represented by the 
area, we can deduce a simple formula for the height. Suppose that the 
rectangle in question is of height h, and lies between the values .v, and 
Xi of .V. Its area must therefi're be h{x., — .v,) — // 5 .v (say). This area 
is equal to the number of values of .v it^ the sample lying between .v, and 
X2, and that is —- hF, say. (I'or there are F,. v'alues smaller 

than X2, and of these the F,, smaller than .v, do not lie between .Vj and 
jfg.) 'I’hus h . Sx = &F, or h — BF/Hx. 

1 his formula will hold if we make the areas represent the absolute 
frequencies. But it is of course equally possible to arrange the histo¬ 
gram to represent the relative frequencies or proportions in the satnple; 
it is enough to divide by «, the sample number, or alternatively, without 
changing the histogram itself, the unit of area can be multiplied by n. 
'I'he formula for the height of the rectangle will then be 

h = «-> . SF/Bx . . . (20.11) 

Now the histogram shown in Fig. 20.3 has a fairly regular outline, 
shaped rather like a bell, as we have already remarked. I'his outline is 
broken up into a series of steps; hut it seems plausil)le that if we take 
narrower and narrower grouping intervals the steps will become less 
noticeable, and the tops of the rectangles will approach a smooth curve. 
Such a curve is suggested in Fig. 20.3. Naturally it will be necessary 
to increase the sample size n as the intervals become narrower, for 
otherwise we should come to a point at which the numbers falling in 
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the groups would be too small to give a regular outline. The limiting 
curve obtained in this way is called the graph of the “probability den¬ 
sity” or “distribution function” y — [The nomenclature and 

notation differ slightly in different books, and sometimes the symbols 
f(x) or F(.v) are used instead of From equation (20.11) it is pos¬ 

sible to find a relation benveen and <f>{x). (We suppose that the areas 
of the histogram represent relative frequencies or proportions, rather 
than absolute frequencies: this means that we can increase the sample 
- number n without greatly altering the shape or size of the diagram.) 
Now when n becomes large, i.e. the fraction of the whole sample 

falling between the values Xi and a : 2 , will approach SP = — P*^, 

i.e. the probability that a value of x chosen at random will lie between 
Xi and a: 2 - Thus for a sufficiently large n the height h cx SPjSx. If we 
now allow the interval wdth S.v to tend to zero the quotient SPjBx will 
tend to the derivative dP/dx = while the height h will tend to the 

ordinate y ~ ^(.v) of the smooth distribution curve. Thus 

y = ^(.v) = P^Pa- . . . (20.12) 

^(.v) is simply the derivative of the cumulative probability function P*. 

What is the meaning of this function ^(.v)? To say that it is the limit 
of SP/5.V as Sa: o is to say that it is practically equal to hP/ 8 x when 

Sa.’ is sufficiently small, or SP ^(.v)Sa:. That is, the probability of a 
value of a : falling inside a small interval of length S.v is approximately 
(^(a;)8.x = Alternatively the relation can be expressed in integral 

form. Since «;6(x) = P*P*, we have 

^(.v) dx = P,^ — P^, . . (20.13) 

But we have already shown that P^^ — P,, is the probability of lying 
between .v, and X2, or the proportion of the population between these 
values. Equation (20.13) shows that this is equal to the area under the 
curve ^(.v) between the ordinates at .Vj and x^, as shoAvn in Fig. 20.2. 
This is indeed natural, since the histogram itself represents proportions 
by areas, and so one would expect the area under the limiting cur\^e 
y =- (f>{x) to represent a proportion. This relation also shows that the 
total area under the distribution cun’e must be i, since it represents a 
fraction i/i of the w^hole distribution. 

^(.v) dx = I . . . (20.14) 

We may note that in order to introduce a distribution curvejy = ^(.v) 
no less than three idealizations have had to be made. The measured 
quantity x has been treated as continuously variable, and for this it has 
been necessary to suppose that there is no limit to the accuracy of 
measurement, so that v can be considered as a real number or unending 
decimal. Secondly the probability P^. has had to be defined as a pro¬ 
portion in an infinite sample. And thirdly we have brought in the 
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derivative of the probability, defined as the limit of SP/ 8 x as 8 x tends to 
zero. Now strictly speaking we cannot measure with more than a cer¬ 
tain accuracy, we cannot take more than a certain size of sample before 
the experimental conditions are noticeably altered, and we cannot in 
many cases let the interval 8 x of measurement tend to zero because of 
the atomic structure of matter. But, for all that, the idea of a proba¬ 
bility density or distribution curve is of the greatest practical use. 


Table 20.2—Calculation of mean and standard deviation of 

gestation time 


Time x 

Frequency 

Code 

1 

1 

f 

(days) 

fx 

X 

UX 

200-209 

3 

—7 

—21 

210-219 

*4 

-6 

-84 

220-229 

25 

-5 

— 125 

230-239 

60 

-4 

—240 

240-249 

98 

-3 

-294 

250-259 

234 

—2 

—468 

260—269 

547 

— I 

-547 


*567 

1 

— *779 

270-279 

0 

1 

280-289 

2250 

I 

2250 

290-299 

1114 

2 

2228 

300-309 

325 

3 

975 1 

310-319 

109 

4 

43 *> 

320-329 

44 

5 

220 

330-339 

22 

6 

132 

340-349 

4 

7 

28 

350-359 

3 

8 

24 


n = 1 


6293 

'I’OTAL 

6419 , 

1 

! 

= 45*4 


fxX* 


H 7 

504 

625 

960 

882 

936 

547 


2250 

4456 

2925 

*744 
1100 

792 

196 

192 


T _ 

j 2 — 


18256 


x = 274 5 {- loA' 

^ == 'i'xin = 4514/6419 = 703225 
X = 2745 + 10^ == 2745 + 703 == 281-53 

/ "" Zl ~ V = *^256 — 4514 X 703225 = 15082 

) — •b/(« 1) 15 — 2 267 [sample variance of JC\ 

= 226-7 [sample variance of .rl 

s = V^(^) =1506 ’ 
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20.8 Mean and variance of a continuous distribution 

The mean and variance of a sample from a continuous population 
can be found by the methods explained in Sections 20.4 and 20.5: they 
will be exactly the same as for a discontinuous population. However if 
the sample is very large it would be too laborious to set out all the 
separate sample values. But the grouped distribution can be treated 
by the same method as used for a discontinuous distribution in Section 
20.5. As an illustration we shall take the data on time of gestation 
shown in the histogram of Fig. 20.3, and specified more accurately in 
the table on p. 585. 

Explanation of calculation 

The first column shows the grouping intervals, with the gestation 
times specified to the nearest day. A little care is usually needed to 
specify these intervals accurately. Here, if we assume that a stated time 
of 200 days really means anything between i99'5 and 200*5 <^ays. and 
one of 209 days anything between 208 5 and 209*5, group should 

properly be from 199*5 209*5, with mid-point at 204*5 days. The next 

group will have mid-point 214*5 days and so on. 

Strictly speaking even this argument is not quite accurate, since the 
gestation time will usually be calculated by subtracting the supposed 
date of conception from the date of birth. This subtraction by itself 
may introduce an error up to one day; and in addition the true date of 
conception will be rather uncertain. Some correction for this uncer¬ 
tainty was made in the original paper by Karn and Penrose, but we 
shall not concern ourselves with these additional complications here. 

Now the mean and variance of this distribution can be obtained by 
treating the observations as if they were concentrated at the mid-points 
of the group intervals; e.g. we treat the three values lying between 200 
and 209 (or more strictly bet^veen 199*5 209*5) if they were all 

equal to 204*5. fourteen values between 210 and 219 will all be 

considered as 214*5. The total sum of all the observational values, 
will therefore be 3 x 204*5 -f 14 X 214*5 + • • • + 3 X 354 ’ 5 » 
mean x will be TJti = 281*53. the arithmetical labour can be 

greatly reduced by a device known as “coding”. This amounts to no 
more than a change of scale and origin of measurement, whereby the 
variable .v is replaced by a more convenient variable X, taking the values 
—7, —6, —5, etc. at the mid-points of the grouping intei^^als, x = 
204*5, 214*5, 224*5, • • • respectively. This “code” is so chosen that 
X = o corresponds to the mid-point of an inter\"al near the centre of 
the distribution—here to .v = 274*5. "^he relation between x and X is 
therefore 

X = 274*5 + 

since a change of X by i corresponds to a change of to in .v. We now 
proceed by first finding the mean and variance of X. The total of all 



DISTRIBUTIONS 


507 

values of X is the sum of all products of the form i.e. 3 x (—7) 

+ 14 X ( 6) +...+3x8. In order to facilitate this calculation 

the values of/^ are entered in the fourth column: they are obtained of 
course by multiplication of the entries in the second and third columns. 
The total of all these entries gives Tx = 4514. Some computers prefer 
to add the negative and positive products separately, and then combine 
them, as we have done in our table: the negative products sum to 
— 1779, and the positive ones to 6293. and —1779 + 6293 — 4514. 
The mean X = Txin in the usual way; and from this we can get the 
mean x o( x (which is what we really require) by using the relation be¬ 
tween a: and Xi x = 274 5 + 10 X = 281-53. 

The “crude sum of squares” for X is similarly calculated by 
summing all values of/^AT^; these are given in the last column. We can 
again obtain the “deviance” S = T^ — Tx^ and the variance of X = 
Sl{ti — i) = 2-350. At this point in the computation it is usual to sub¬ 
tract yV from the variance: this is known as “Sheppard’s correction”, 
and allows for the inaccuracy due to the grouping of the distribution! 
The corrected variance is accordingly 2-350 — 083 = 2-267 in terms 
of the variable X. Since i unit of X is equal to 10 units of .v, and the 
variance is defined (effectively) as the mean square deviation, it follows 
that the variance of .v = io“ x the variance of AT = 226 7. The esti¬ 
mated standard deviation s is therefore \/22('> j = 15 06. 

Now we can expect the sample mean x and the sample variance to 
tend to definite limits when the sample number n is increased indefin¬ 
itely and the width of the group interval is made to tend to zero. But 
the mean x is defined as 2 : xfjn, where /, is the number of cases in the 

group interval centred at a'. Also as tends to the proba¬ 

bility,/)^, say, of lying in this interval: and when this interval is narrow 
(or width Sx say) we know that approximates to ^{x) Sx, and so x 
121 Zx<f>{x) hx, summed over all the group intervals. As the widest 

group interval tends to zero this sum tends to the integral /"* Xif>{x) dx 

taken over the whole range of values of a, where a denotes the least 
possible value and b the greatest. We can slightly simplify this formula, 
if we wish, by the following trick. Instead of taking the range to be from 
a to b, we take it to be from —co to co, and define <^(.v) to be zero for 
those values of a which never occur, i.e. those outside the true range 
from a to b, so that these values do not contribute anything to the 
integral. 'I’hus the limiting value of the sample mean is 

^ a^(a) dx = x 4 .{x) dx . . . (20.15) 

and this is accordingly called the “true mean” or “expected value” 
Ca of A corresponding to the distribution function 0 (a). 

This may be compared with formula (20.2). Exp^ which 
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defines the true mean for a discontinuous distribution. The essential 
change is that the probability of taking the value x is replaced by the 
probability 8 x of falling in the small interval of length Bx; the sum 

2 ^ x<l>(x) Sjc is then replaced by the integral J dx. In the same 

way the expression (20.6) 

V — SpJjx — uY for the true variance becomes 

V = {x — dx 

J a 

= f (a:— ^Y(l>{x) dx . . . (20.16) 

J ^00 


for a continuous distribution. This can also be written 

V = f — 2/u: + fi^)<^(x) dx 

= f dx — 2^ f .v^(a*) dx + /" dx 

J —00 J —00 J —00 

.v^^(.x) dx — 2/x . /i + /X* . I 



[by (20.14) (20.15)] 



X‘ 4 >(x) dx — /i® 


(20.17) 


The true standard deviation <7 of the distribution is defined as the square 
root of the true variance. It is not difficult to show that in large samples 
the sample variance v = will approximate to the true variance v — a- 
provided that a sufficiently fine interval of grouping is used, or provided 
that it is calculated directly from the formula v — 2 ^ (.Va — xYI{n — i) 
without any grouping of the observed values. 


EXAMPLES • 

(i) A variable x is restricted to values between o and i, and its 
distribution function between these values is «^(.v) = i. Show that this 
is a possible distribution function, and find the true mean and variance. 
(This “rectangular distribution” can be loosely described by saying that 
all values between o and i are equally probable.) 

In order that <f>{x) should represent a distribution function it is 
clearly necessary that it should satisfy two conditions: it must be 
positive, and its integral over the whole range must be i. Now here 

<^(.v) ^ I > o, and ^(.v) dx = y*^ i dx = [.v]^ = i. 

(We can here restrict our integrals to the range o to i. Or alternatively 
we can integrate from — 00 to co by the device of defining ^(.v) to be o 
outside this range. In any integral containing ^(.v) as a factor, all 
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integration outside this range will give a zero contribution and can be 
neglected.) 

By definition the true mean is 


and by (20.17) the true variance is 

f\ [J-v®]; — i = i i = tV. 


(2) A variable x is restricted to positive values and has distribution 
function <f>{x) = (where <2 is a positive constant). Verify that 

this is in fact a distribution function, and find the true mean and 
variance. (This is the so-called “exponential distribution”. It is the 
form of the distribution of the time of decay of an atom of a radioactive 
element, and also is an approximate representation of certain “waiting 
time” distributions—e.g. a: might be the time spent in waiting for a bus 
or a telephone call. Presumably x might equally well be the time be¬ 
tween the opening of a flower and its pollination by an insect.) 

We have (^(x) > o and 


^(x) dx = 






so that <^(x) does in fact represent a distribution function. Now we 
know that 


y dx = — xe~°^ja 

J xV“®*(/.v = —2e““*/a® — 2xe-°^ja^ — x*e“®'/a 


(by (*5-5)> o*" hy direct diflPercntiation of the right-hand side) and there¬ 
fore 

^ — J^ xae-'*^ dx = [—— axe~®'/o]" = ija 

/ oe 

x^oe“®* dx — u* 

u 

= [— 2ae~’‘^ja'-' — zaxe-'^^ja'^ — — i/a^ 

— 2ja^ — ija^ — \la^ 

a — -s/v = ija 


(3) A variable x is unrestricted in value, and has the distribution 
function (zrr)-* e-^'\ Show that this is possible, and find its mean and 
variance. (This is the standard form of the “Gaussian” or “normal” 
distribution, which we shall consider later in more detail.) 


In order to evaluate the integrals it is convenient to split the range of 
variation into two parts, the negative part from —00 to o, and the 
positive part from o to 00. Consider first the positive part; here we 
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change the variable from x to u= so that x 
formula (ig.i6), e-^ x^ dx — |m, where |- 

(See Section 19.16.) So 

J <j>{x) dx = (Ztt)-* J dx 


=: (21/)*. We also use 
-J = TT*, 10 = I and 


0 

«0 


= ( 27 r)”* J e““ Z>tp);. du 
~ ( 27 r)“^ J . 2“* du 

= (27r)-* . 2-* , | —i i 

:c«^(a:) (/a* = ( 27 r)-* (zu)^ . e-“ . D^a . du 


0 

00 




0 

.00 


= (277)** du = ( 27 r)*^» 


0 

.00 


J aVG^) == (27r)-* J^ ZU . du 

= (27r)-t.2^ f 


For the range from — co to o we substitute again u = ^a*, or a— 
—(2m)*, since x is negative. A similar calculation sho'ws that 

^ 4>(x) dx = ^ x4>{x) dx = —( 27 t )‘, a 2 ^( a ) dx= \ 

Addition of the integrals over these two ranges gives 


/** <^(a) dAr=^ + i= i,as should be; 

J ^00 


V 



A«^(.v) dx = (Ztt)^ — ( 27 r)* = o; 
a 2 .^(a) dx — o — 


I 


a 



20.9 Change of variable 

In the calculation of the mean and variance of a sample we used a 
device of coding, i.e. of changing the origin and scale of measurement 
in such a wzy that the calculations are simplified. It is worth while 
writing down the relations in general form. Suppose that X is the value 
of a measurement on one scale, and a on another, so that the relation 
between them is 

a = ^+-BA' . . . (20.18) 

where A and B are constants and B is positive. Thus if X represented 
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the temperature measured in degrees centigrade, and x the corres¬ 
ponding value on the Fahrenheit scale, the relation would be x = 
32 4- i SX. Now if we have a sample consisting of a set of values 
■^1* •^2» • • • Xfi of X, equation (20.18) will give the corresponding values 
of Xy say x^y X2, • • . x^. There will therefore be a mean, X say, a variance 
v{X) and a standard deviation r(A') for the variable X. We can calculate 
these values by the procedure given above. There will also be corres¬ 
ponding values of a:, say and s(x) respectively. We assert that 

these can be obtained from X, v(X)y s(X) by the relations 

x = A-bBX; v{x) = . v{X); s{x) = B . s{X) . (20.19) 

Th^e relations follow from the general argument we used in discussing 
coding. They can also be proved directly from the definitions: we shall 
give the proof for the mean, leaving the (more complicated but straight¬ 
forward) proofs for the variance and standard deviation to the reader. 
We have 

X = (total of A:)/(sample number) 

= 2^ Xajn 

= Z {A + BXa)lri 
= {An + B ZXa)ln = A -h BX. 

Now in a very large sample the sample mean, variance, and standard 
deviation will tend to the true mean, variance, and standard deviation 
respectively. Thus by taking a large sample we obtain from (20.19) the 
corresponding relations between true values 

lx=^A^BLXy v{x) = B^~ . v{X)', g{x) = B.g{X) . (20.20) 

where v{x)y v{X) denote the true variances of x and X respectively, and 
*^{X) the standard deviations. 

It remains to find the relation between the distribution functions 
say <P(X) of the variable X, and ^x) of a-. Now we know that if X^ and 
X2~ Xi SA'i are two neighbouring values of the variable, the chance 
of finding a value of X between them w ill be nearly 0 (Xi) 8 Xi: for at the 
corresponding chance will be si(A:i) Sx^. But X and a- are simply two 
different ways of measuring the same quantity, and so these chances 
must be equal: that is 

<P{Xi) SXj = ^(a*i) Sa:,. 

Now Sx. = x^-x, = {A + liX,) -{A + BX,) = B(X. - X,) = 
BhX^: and therefore on division by the equation gives ^{Xi) = 
B 4 »{x^). But Xy is really any value of X we choose, and .v, the corres¬ 
ponding value of at; so we can drop the suffix and write finally 

m=^{X)IB . . (20.21) 

Note .—This is really only a special case of a general result which 
states that if X is any variable, and x = ^{X) is related to by a given 
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function ^ in such a way that each value of X within its range of distri¬ 
bution gives one and only one corresponding value of x, then the distri¬ 
bution functions are related by the equation = <P(X)l\Dxx\. How¬ 
ever, we shall not require this general result here. 

EXAMPLE 

(i) A sample taken by K. Pearson from the British population of 
adult males gave the following results for the height X measured in 
inches: mean X = 68*64, variance v{X) — 7*30, standard deviation 
s{X) = 2*70. What are the corresponding values for the height x 
measured in centimetres? 

The relation is x = 2*54^', i.e. A ~ o, B = 2*54, and therefore 

S = 2-54X = 174-35 
= 2-54- v(X) = 47*10 
s(.v) = 2*54 ^(A') = 6-86 


20.10 The Gaussian or normal distribution 

We have already noted that many distributions met with in practice 
tend to have a bell-shaped distribution cur\e. This means that the 
observed values tend to cluster round a central value, while the proba¬ 
bility decreases rapidly on each side, but usually without any absolutely 
sharp ends beyond which no values can occur. 

Now a typical distribution of this type is that with probability den¬ 
sity ^(X) = (277)-! We have already considered such a distri- 

bition in Example 3 of Section 20.8, and have shown that the mean 
CX of A' is o and the standard deviation <5(A’) is 1. Suppose we make a 
change of origin and scale, say by the relation x = A -r BX. The effect 
will be to have a new distribution curve of similar form but displaced 
so as to have a new mean fx (say) = Cv and a new standard deviation 
rs = c(a*). But from (20.20) we obtain 

Lx = A -j-B LX = A 
a = a(.v) Bo{X) = B 

so that the connecting relation .v = A BX can be written 

X = -r aA; X = (.v* — ^)/(t . . (20.22) 

From equation (20.21) we can also write down the distribution function 
for X 

^(.v) = 0 {X)IB 

= { 27 T)-i 

= 0-1 (2ir)-^ . (20.23) 

The general form is shown by the curve in Fig. 20.2, p. 581. It is a 
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as we can show by direct differentiation 
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= o in Fig. 20.2), 


D^<f>{x) =: (27r)-t [ — (^x — 

whence D^<f>{x) can only be o when .v — = o, or .v = since all the 

Other factors are non-zero. Also by a second differentiation 

4>{X) = (27r)-* f(.V _ ^)2 — 

and since all other factors are positive we see that D^'^ 4 {x) must have the 
same sign as (x — fi)~ — a-. So D,.-<l>{x) is negative, and the curve is 
concave downward, if (.v — fj.)- — a- is negative, i.e., if (.v — ^)- < <y-, 
or on taking square roots, if |.v — < <t. That is, the downward 

curvature extends to a distance a on each side the mean; at a further 
distance away the curvature is concave upwards. We can thus identify 
the standard deviation of a curve of this particular shape as the distance 

from the mean to a “point of inflexion”, a point at which the curvature 
changes sign. 

A distribution of this form is known as a “Gaussian” or “normal” 
distribution. It is of great importance in statistics for several reasons: 
these may be collectively summarized in the assertion that a great num¬ 
ber of distributions which turn up in practice are approximately Gaus¬ 
sian, and sometimes they are quite surprisingly good approximations. 
Thus a normal distribution curve is superimposed on the histogram of 
the gestation-time distribution in Fig. 20.3, p. 582: it will be seen that 
there is fair agreement. 'I'he distribution of heights in the adult popu¬ 
lation and the distribution of weights are also nearly normal. 

Why should this be? Why should many naturally occurring distri¬ 
butions approximate to a single definite shape with the rather special 
and complicated form of function (20.23)? 'I'lie answer appears to be in a 
very general result which states that (under certain conditions which we 
shall not discuss here) if the variation in a measured quantity is due to a 
large number of small effects added together, tiien its distribution is 
practically normal. Thus we can imagine that the height of a person is 
affected by a large number of genes, each one of which contributes a 
certain small amount to the height; it is also coticeivable that a number 
of environmental effects may also play a part. The same may be true of 
gestation time, and of other characters which have a normal distribution. 

In fact this seems to have impressed the older school of statisticians to 
such an extent that they decided to call the distribution normal, meaning 
thereby that it was the usual and proper form for it to have. But now it 
IS recognized that this term is somewhat misleading, as there is nothing 
abnormal” or pathological in a deviation from normality in this tech¬ 
nical sense: the so-called “normal” distribution is important, but it is 
by no means the only possible form, 'fhere is therefore a modern ten¬ 
dency to call it the “Gaussian” rather than the “normal” form to avoid 
QQ 
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this unfortunate connotation, but the word normal is very well estab¬ 
lished and the reader must be prepared to accept this special technical 
meaning. 

In a normal distribution it is easy to calculate the probability of 
obtaining a value within any given range, as in the following example. 


EXAMPLE 


(i) Assuming that the heights of adult males in Britain have a nor¬ 
mal distribution with mean 174 35 standard deviation 6-86 cm, 

what is the chance that a man selected at random will have a height x 
between 170 and 180 cm? 

It is convenient to make a change in origin and scale of measurement 
so as to use the variable X = (.v — i74-35)/6*86, which accordingly by 
(20.22) has zero mean, unit standard deviation, and distribution func¬ 
tion ^(X) — (277)-^ The values 170 and 180 of x correspond to 

values —*634 and +-824 respectively of A", and the required probability 
is therefore 

P= dX 

-.834 


= (277)-! (277)-* 

—00 -00 


Now uiifortunately these integrals cannot be expressed in any simple 
form. But the function f (277)“* which is denoted in the old 

edition of Tables for Statisticians and Biometricians (Biometrikay Univ. 
Coll., Lcndcn) by -i[i -r a(A’)] and in the revised edition by P(X)y is 
tabulated in the Appendix (Table 4). From this table we find P = 
^(•824) — P( —-634) — -795 — -263 = *532. Thus 53-2 per cent of the 
population have heights between 170 and 180 cm. 

This function A[i -f- “(AT)], or F(A), is accordingly the proportion or 
percentage of the area under the standardized form of the normal curve 
to the leh oi the value A'. Since the total area under the cur\'e is i, it 
follows that the area to the right of .Y is i — P{X) = i — ^[i + o(.Y)] 
= — a(A’)]. Furthermore since the curve is symmetrical about the 

mean o, it follows that this must also be equal to the area to the left of 
(—A). If A' is positive this implies that the sum of the two areas, that 
to the left of (~X) and that to the right of A', is altogether 

[i - P{X)] + [I - P(A')] = 2 - 2P(Ar); 

expressed in another way this is the chance of a deviation from the mean 
o exceeding A' in absolute value. For example the chance of a positive 
or negative deviation greater than 2 in absolute value is 2 — 2^(2) = 

2 — 2 X -9772 (from the tables) = *0456. But by definition X = 
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(x — I.e. X is the deviation of the value a: from the mean ex- 

pr^sed in terms of the standard deviation as unit. So the chance of 

finding a value, in any normal distribution, which differs from the 

mean by more than twice the standard deviation g, is or approxi¬ 

mately I m 22. 

20.11 The normal approximation to the binomial 

We have already seen that if an event has a chance p of happening in 
experiment, then the chance of it happening x times in all 
^ happening in the remaining — x trials is = 

\Np^q^~^/\x \ N ~ X, where g = (i —/>). Fig. 19.4 p. 561 shows the 
histogram of the chances of getting .v heads in = 8 throws of a coin 
^ — i), and Fig. 19.5 that of getting x dominants in iV = 8 offspring 
from ^ “intercross” mating {p = ^). We have also shown in Section 
20.6 that the mean value of .v is /i = A>, and the standard deviation is 

a = Vi^pg). 

Now when N and jc are large the exact formula for p^ is rather 
troublesome to work out, and we have established in Section ig 11 the 
approximate formula 

px ( 27 rAp^)“* 

= ( 27 r)-* CT-* 

v\Jich is exactly the same as formula (20.23); ' c. the probability of 
obtaining a; successes in the binomial distribution is very nearly equal 
to the ordinate of the normal distribution curve of equal mean = .\p 
and equal standard deviation a = Vi^Pg)- This is illustrated in Figs. 
19.4 and 19.5, where the normal curve is drawn on top of the histogram. 
But really we arc not so much interested in the ordinates of the curve as 
m the area under the curve. We can readily translate our formula into one 
concerning an area by observing that in say Fig. 19.4 the average height 
of the curve between the values x -- 1-5 and x = 2 5 is nearly equal to 
the ordinate at x '= 2; and therefore the area under the curv'e between 
•» = 1-5 and x -= 2 5 is also nearly equal to this ordinate. Similarly the 
ordinate at x = 3 will be nearly equal to the area between x = 2-5 and 
^ — 3 ' 5 . and the ordinate at x = 4 is very nearly equal to the area 
between 3 5 and 4*5. So the total probability of obtaining 2, 3 or 4 heads 
f IT 8 coins is very nearly equal to the sum of the ortlinales 

of the corresponding normal curve at x = 2, 3, and 4, and tins in turn 
18 very nearly equal to the sum of the corresponding areas under the 
cu^e between x = 1*5 and 2*5, between x ^ 2-5 and 3-5, imd between 
^ — 3*5 aod 4 5, i.e. to the total area between .v ^1-5 and 4 5. Fx- 
pressing the same argument in general terms we see that 

“the probability of obtaining in N trials any number of successes 
between x, and inclusive is approximately equal to the area 
between the ordinates at x, — ^ and i under the normal 
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curve of mean /x = Np and standard deviation a — V{^P 9 )y i-C* 
it is very nearly equal to 

(x^±±^p\ _ ( x, -i^Np y. 

\ V Npq ) \ V Npq ) 

This expression can be fairly readily found from tables of the normal 
integral P{X). 

This expression may seem a little complicated at first, but it is very 
much simpler than the direct computation and summation of the proba¬ 
bilities, and it is also quite surprisingly accurate if N is even moderately 
large. 


EXAMPLES 

(1) The chance of obtaining 2, 3, or 4 heads in a toss of 8 coins 

is thus approximately given by .Vj = 2, = 4, A'’ = 8, 

q^i -p = = V(Npq) = ^2. 

P =. ^ f Qi^) 

= P(-354)-P(-i.768) 

= 638 — 039 = -599, 

The exact probability is 

p= \s_(mm\i\t+ ii(mi)V|3 i5 + i8(mDVi4ii 

— •60156. 

(2) The sex-ratio of births in Britain is approximately 51-5 males to 
48 5 females. Assuming this value to be correct, what is the probability 
that in 10,000 births occurring in a large town the number of males will 
lie between 5100 and 5200 inclusive? 

Mere N ~ 10000, p = probability of a baby being male = *515, 
q = i — /) = *485- So the number .v of males will have a binomial distri¬ 
bution with mean — Np = 5150 and standard deviation a = V(A>?) 
— 50 0. 1 he chance of obtaining a number between 5100 and 5200 

inclusive is equal to the chance of obtaining a value between 4999*5 and 
5200-5 in the corresponding normal distribution. This is equal to a 
deviation of 50 5 on either side of the mean, or i*oio times the standard 
deviation. The range of values of the standardized variable or normal 
deviate A' is therefore from —i-oio to H-i oio. The probability of 
finding a value within this range is accordingly 

P(ioio) — P(~ioio) = 835 — 165 

= 670. 

Note .—This normal approximation may fail when the product Np 
is small—say less than 5. However, it can be shown that when N is 
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large and^^ is small the probability of jc successes in N trials is approxi¬ 
mately e - this is the so-called “Poisson limit”. (sL for 

^■ample G. U. Yule and G. Kendall, Introduction to the Theory of 

14th edn., 1950, Griffin). A similar failure can occur if q is 

PROBLEMS 

(1) The death rate (per year) in a large population is found to be iic 

per thousand at a certain age. What is the chance that of 1000 randomly 

chosen individuals it will be found that between 100 and ico die in the 
lollowing year.^ 

(2) Consider 1000 children whose fathers have blood-groups 4 B 
and whose mothers have blood-group O. What is the chance ‘that 
between 500 and 550 of the children have blood-group A^} 

20.12 Two-variable discontinuous distributions 

We have considered above the distribution of a single measurement 
or observation taken on a population of animals, plants, e.xperiments or 
other objects; thus we have had the example of the distribution of heights 
of adult men in the British Isles. Now quite often we shall take more 
than one measurement: e.g. “x” might be the height, “3-” the weight 
and 2 the age of a person, so that for each person we shall have three 
measurements, one of x, one of y, and one of tr. Together they will have 
a combined” or “three-variable” distribution. 

Such distributions can be continuous or discontinuous, and it is 
also possible for one variable, such as x. to have a continuous range of 
variation, and another, such as to be discontinuous. This would be 
on^t^^^ ^■^P^^scnted the height of a plant and y the number of leaves 

We shall consider first the case of a two-variable discontinuous distri¬ 
bution. As an e.xample we might take a sample of families in a given 
town and tabulate them according to the number of sons (male children) 

X, and the number of daughters (female children) y. The obvious way 
ot summarizing such a sample would be in a table set out as on p. 598. 

ir 20=/yo families with no sons and no daughters, 

““/i 0 families with i son and no daughters, and so on; in general we 
wn denote by/,^ the number of families with x sons and y daughters, 
e total for =/oo +/01 + • • -f-/oi = ^/ov 'Jf first column is 

clearly the number of families having no sons, irrespective of the num¬ 
ber of daughters. Similarly the other column-totals fn- and f 
are equal to the numbers of families with i, 2, 3, and 4 sons respectively: 
tnese column-totals therefore give the observed distribution of the 
vari^Ie X considered on its own. In the same way the row-totals 

rZHJv ^ 32, fT2 ^ 16. /ya = 4 > /ta = 2 show the number of 

lamilies with o, i, 2, 3, and 4 daughters respectively. 'I’he sample size n. 
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or total number of families in the sample, can be obtained by adding 
either the row or the column totals: this provides a check on the 
arithmetic. 


Table 20.3—Distribution of families according to the numbers of 

sons and daughters 


y = number 
of daughters 

X = number of sons 

Z/xyX 

0 

I 


3 

$ 

4 

Total 

4 

0 

0 

1 

I 

1 

0 

2 

5 


/o 4 

/l 4 

fz\ 

fzi 

fa 

fn 


3 

I 

I 

2 

» 

1 

0 

0 

1 1 ■ 

4 

s 


/o 3 

/l 3 

1 f 

1 

f S 3 

fiS 

Its 

1 


2 

5 

5 

4 

2 

0 

1 

16 

tg 


i /02 1 

f.z 

/a 2 

f 32 

f 42 

/ra 1 

1 


I 

1 

15 

9 

! 6 

I 

1 

1 

32 

28 


/oi 

/u , 

/21 

T. 

1 

/41 

fn 

1 

0 * 

20 ' 

f 

16 

i 8 

, 1 

k 

2 

0 

46 

38 

i 

/oo ' 

/i 0 

f 2 0 1 

/30 

f 4 3 

fro 


Total ! 

41 

3 t 

21 1 

6 

I j| 

1 

i 100 

95 


/or 

f\T 1 

1 

fzT 1 

fsT 

Ar ^ 

' n 

1 



28 

1 

22 

1 

24 1 

9 : 

1 

1 

, 1 

I 

I 

84 

i 

1 



Now it is clear that f^Qln — 20/100 provides an estimate of the 
proportion of families in the general population with o sons and o 
daughters. If we take a very large sample such an estimate will be a very 
good one, and we can say that as n -> co the relative frequency /oo/«. 
tends to a limit />oo* which is the true proportion of such families. 
Strictly speaking this limit is an idealization, since the available popu¬ 
lation of families is not infinite. But it is convenient to speak in that 
and leads to no serious error in practice. In the same way the 
relative frequency will tend to the probability of a family 

having .v sons and r daughters, and the marginal relative frequency 

At, « - will tend to the probability = ^pxv that a family 

will have .v sons, irrespective of the number of daughters. 

Now from this table we can calculate a mean and variance for each 
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of the variables x and y in the usual way, by considering their distri¬ 
bution alone. 

For example the calculations for x will proceed as follows; 


1 

1 

X = number of sons . 

0 

I 

2 

3 

4 

Total 

fxT = number of fam¬ 
ilies with X sons 

41 

31 

21 

6 

1 

I 

n 

ICO 

xf j-T' * 

0 

31 

42 j 

1 

18 

1 

1 

4 

T{x) 

95 

X^T 

0 

; 3 t 

1 

; 84 

54 

16 

T{x^~) 

185 


T{x) 

T{x^) 


X = 

= 

= 


^/ztX = 95 
Zf,rx^= 185 

T{x)ln = -95 

T{x^) — xT{x) = 185 — 90-25 = 94-75 
S,J{n - i) - 9475/99 = -9571 

= •9783- 


Here 7 '(jf) means the sum of all the values of x, and 7 '(.V') the sum of all 
the values of .v is the mean of .v, and is the sum of all the squared 
deviations from the mean, £frr{x — x)-. is the estimated variance 
of x; we shall explain presently why we use a double suffix for this, and 
Sx is its square root, the estimated standard deviation. A similar calcu¬ 
lation for gives T(y) ~ 84, y -84, «’y„ — '9438, = *9615. If we 

take a very large sample we can expect x to be a close estimate of the 
true mean Hx = of x, Vxx to be nearly equal to the true variance 
and Sx nearly the true standard deviation cr,; and similarly for j. 

We can also calculate a quantity Sxv ^ fxv (x — x) Q- — ^), or 

“sum of products of deviations from the mean”, or “codeviance”. 'This 
quantity is in a way intermediate between Sxx, the sum of all squares 
of deviations {x — xy from the mean of x, or “deviance of .v”, and 
the similar sums of squares of deviations for y. On dividing Sxv by 
(n — i) we obtain = SxJ{ft — 1) which is called the (estimated) 
“covariance” between x and y, and can be compared in form with 
Vxx = SxJ{n — i), the estimated variance of .v, and = SyJ{n — 1 ), 
that of y. 

Two questions arise in connection with this: what is the meaning of 
the covariance Vxyy and how can it be calculated most expeditiously? 
For convenience we shall deal with the second question first. 
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By a direct algebraical transformation, we have 

Sxv = {x —x){y —y) 

= [xy ~xy ~xy +xy] 

==^/xv^' —xZf^y — yZf^yX^ +xyZf^y 
= ^/xy xy — X («_y) —y{nx) + xy w 
= ^fxv xy — nxy 

= ^fxv xy — T(x) y .... (20.24) 


Write T{xy) = Zf^^yXy. This is called the “crude sum of products”, 
and is the sum of all the products xy = (number of sons x number of 
daughters) for all the individual families. The term T(a;)j>’ (i.e. total for x 
X mean of y) which has to be subtracted is called the “correction for the 
mean”. 

From the entries in each column of Table 20.3 we can calculate a 


sum Z f^yy by multiplying each entry by the corresponding value of y 


and summing. For the first column, x — o, we have Zf^^y = 0x4 

+ i X3+5X2 + 15X I +20x0 
the foot of the column. Now Zxy 


= 28; this total is written at 


^[xZ/^yy"], so that if we 


multiply each of these totals by the corresponding value of and add we 
obtain the crude sum of products 


r(.\3') = Zf^y -V3; = 2Sxo-f22Xi+24X2+9X3 + iX4 

= lOI, 


and from this we find the 


Codeviance 

Covariance 


= r(.y>>) — T(.v)jv = loi — 95 X -84 == 21-20 
= Sgy‘(n — i) = 2i-2o'99 — -2141. 


This calculation can be checked by performing it in a different order, 
summing first along the rows to obtain Zf^y\\ as shown in the last col¬ 
umn of Table 20.3 (oxo-^ ox i + i X 2+1x34-0x4 = 5, 
etc.), and then multiplying each element of this column by the corres¬ 
ponding value of V and summing, to obtain Z{yZf^yx) = $ X 4 

y ^ 

"h 5 X 3 -t- 19 X 2 + 28 X I 38 X o = loi. We have a further 
check in that the simple (unweighted) total of the last row, 84, must 
equal the total T{y) of y, since 





Similarly the unweighted total of the last column, 95, must equal T(x)y 
as it does. 

What does the covariance mean? By definition it is v^v = 
^ fxv (•'■’ — A’) (>’ — ~ i)- if the divisor was n instead of (;/ — i) we 

could interpret this very' simply as the average of (a; — ;f) (j — y), the 
product of the deviations of .v and y from their respective means. The 
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- ^ ~ counteract the 

slight inaccuracy due to the use of the sample means ;c and y instead of 

the true means and exactly as in the calculation of the variance 
If « IS large it does not really matter which divisor is used. 

Now let each pair of observed values {x, y) be represented by a point 
with cartesian co-ordinates (x,y). The whole sample will then become a 
set of points m the plane, called a “scatter diagram”. (Such a diagram 
Z ^^^*-epresenting the sample pictorially. showing the general 

form pf the distribution in a way which appeals to the eye.) Let us draw 
a vertical line to correspond to the mean x-, and a horizontal line to 
correspond to j; thereby dividing the plane into four quadrants (Fig 
20.4). Now if the point {x, y) lies in the upper right-hand quadrant 



Fig. 20.4 — The sign of the product (x — x) (y — y) 


^ T positive, and therefore so will the 

product (x — x) (y ~y). Similarly a point in the lower left quadrant 
provides a positive product, while points in the upper left and lower 
right quadrants give negative values of (x-x)(v-y). Roughly 
speaking, if the distribution is mainly concentrated in the upper right 
and lower left quadrants there will be more positive products than 
negative ones, and we shall expect and the covariance to be positive 
on balance. (A striking example of tliis occurs in Table 20.4 p 604 
shown diagrammatically in Fig. 20.5, where there is a clear concentra¬ 
tion of values in the left-hand lower and right-hand upper corners and 
few values in the other two corners.) If the reverse is true the covariance 
will be negative. Thus the covariance can be considered as a measure of 
association between the two measurements x and v. For example 
suppose that a: represented the height and y the weight of each member 
ot a random sample of adults. On the whole we expect tall persons to 
DC heavy, and short persons to be light, .so that most of the points in our 
catter diagram will lie on the upper right and lower left, and the co- 

positive. Of course there will be e.vceptional cases 
01 tail thin light individuals, and of short heavy ones; but these will not 
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reverse the general trend. Any two body measurements will in general 
have a positive covariance: a large height will tend to be associated with 
large arm- and leg-lengths, large waist measurement, large body area, 
and so on. In none of these cases ^vill there be a complete relationship, 
but the covariance shows the general tendency. The covariance be¬ 
tween the income of parents and infantile mortality can be expected to 
be negative; the higher the social class, the better chance of survival. 

20.13 Correlation 

Since the covariance v^v is effectively the average product of devi¬ 
ations of X and y from their means, it will be measured in the same units 
as the product xy. Thus if.x represents the height of a person measured 
in metres, and_y the weight in kilograms, the covariance will be in kilo¬ 
gram-metres. If we change the unit of height to i centimetre, the 
measurement .v will be multiplied by 100, and so also will be the co- 
variance. On the other hand the covariance is independent of the posi¬ 
tion of the origin of measurement, since it is only concerned with 
deviations from the mean. If x was a temperature measurement, it 
would not matter whether it was expressed on the Centigrade or 
Absolute scale; we should obtain the same covariance. The general rule 
can be expressed as follows: let X be the measurement of a quantity on 
one scale, and x = A BX its value on another scale, with possibly 
different origin and possibly different unit of measurement. Similarly 
let Y and y = A' + -B' T be the measurements of a second quantity on 
two different scales. Then 

= BB' vxY . . . (20,25) 

Now it is useful to have a measure of association which is indepen¬ 
dent of the units of measurement chosen. We can obtain this by dividing 
the covariance by the product of the standard deviations r* and Sy 
of X and y respectively. The quotient is Karl Pearson’s “product- 
moment correlation coefficient” usually referred to simply as the 
“correlation” r, 

r . . . (20.26) 

For example in the case of numbers of sons and daughters in families, as 
shown in 7 ’able 20.3, we have 

r = •2I4i/(-9783 x -9615) — -2276 

The usefulness of this coefficient is that it enables us to compare 
correlations between quantities of entirely different kinds. The cor¬ 
relation between the heights of father and son is about -5, the correlation 
between arni-lengths has about the same value, and so has the cor¬ 
relation between the counts of ridges on finger-prints. As we shall show 
later r can take any value between — i and i, and from its definition it 
has the same sign as the covariance. A correlation of ± i can only occur 
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if there is an exact linear relationship benveen x and y, i.e. if all the 
plotted points (at, y) lie exactly on a straight line y = a ^x. The 
smaller the degree of relationship, the smaller the correlation coefficient. 
When X and y are independent, r = o ; that is, when the value of x has 
no effect, direct or indirect, on that of y. Or, to speak more precisely, 
this will hold in a very large sample; in small samples r may be appreci¬ 
ably different from zero owing to random sampling fluctuations. We 
can establish this theorem, that r o for independent variables, as 
follows. 

Suppose that .v and y are independent variables, and that the probab¬ 
ility of obtaining a value x is and that of obtaining a value y is p^'. 
Then the chance of obtaining the pair of values (.v, y) is pxPJ■> by the 
multiplication law for independent probabilities. If w, the sample 
number, is large, the number/*v of cases in which we obtain the pair of 
values (x, y) will be approximately «/>*v = nPxPy . The mean x will 
approximate to its true or limiting value = T pgX^ so that 

Zpx (jc — ^pxX — pxhx = — ^f^x = o. 

The covariance Vx„ = T(v — .v) (y — i) "ill approximate to 

its true value 

= 2: npxpy (x — /lx) (y — Mv)/(« — 0 

** y 

= ” (-^ “ f^x)^Pv iy — /^y)/(» — l) = o 

The value of the correlation coefficient obtained in a very large 
sample, or, more strictly, its limiting value as u -> 00, is called the “true 
correlation” p = pj.^. It is obtained by dividing the true covariance 
by the product of the true standard deviations ax a^. I lerc, where .v and 
y are independent, we have shown that Vxy = o, and therefore pxy = o. 
Thus one would expect little or no correlation between such qualities 
as hair colour and income group which presumably are completely un¬ 
related. 

Note. —Our use of p for the true correlation and r for the sample 
value agrees with the general convention regarding the use of Greek and 
Latin letters. Rut the reader is warned that many authors use p for the 
sample value, as well as for the tiuc value. 

20.14 Two-variable continuous distributions 

Above we have mentioned correlations between such variables as 
height and weight. Strictly speaking we have not yet delined such 
correlations, since these have a continuous and not a discontinuous dis¬ 
tribution. However it is easy to complete the definition, using the same 
transition from discontinuity to continuity as for a one-variable distri¬ 
bution. 
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A sample of size n will consist of n pairs of values (arj, 

(Xn, jVfi). (These pairs can be represented by points in a scatter diagram.) 
The total T(x) for x will be £xa — + iCg + . . . -f- ^n> the mean x = 

T{x)ln, the sum of squares of deviations, or deviance, of x will be 
Sxa, = {xa — xy = Sxa^ — xT{x)j the estimated variance of x will be 
^xxK ^the codeviance will be S^y = 2 ^ {xa — x) {ya ~y) = 
2 }Xaya — xT(y)y and the estimated covariance Vgy = 5 *v/(« — i), and 
all the other definitions will proceed as before. 

If the sample size is large, it is best to group the distribution, and 
treat it as a discontinuous one. An example is provided by the following 
table; 


Table 20.4—Distribution of percentage overcrowding (x) and 
infant mortality (y) in 29 London boroughs in xqzi 


Infant 


X 

fxTX 

/rr.V* 

^fxyY 


Percentage overcrowding x 


3 

3 

9 

2 


—2 
-8 
16 
-10 


— I 
-II 

11 
— 2 


o 

o 

o 

I 


I 

4 

4 

4 
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6 

12 

— I 


3 

6 

18 

5 


4 

8 

32 

7 


2 

102 

2 


illUi USUi^ 

y 

0— 

4-9 

5 - 

9.9 

10- 

149 

15- 

19-9 

20- 

249 

25- 1 30- 
29 - 9 | 34 9 

35- 

39*9 

T't’l 

y 

fTyV 

fry y* 

SfxyX 

160-9 

— 

4 



— 



4 

1 

I 

4 

% 

4 

4 

4 

150-9 


— 



2 

— 

1 

1 

4 

3 

12 

36 

9 

140-9 

1 


2 

“ 


I 

I 


4 

2 

8 

16 

3 

130-9 

— ■ 


I 

I 


«— 

— 


2 

2 

2 

2 

— 2 

120-9 

1 

I 

4 

I 

I 




7 

0 


0 

-5 

1X0-9 

1 


I 

' 


I 

— 


2 

— X 


2 

2 

100-9 

' I 

1 

1 

3 


I 

I 



7 

—2 


28 

-5 

90-9 

— 

I 




— 

— 

— 

X 

—3 


9 

—2 

80-9 


1 

— 

— 




• - 

0 

—4 


0 

0 

70-9 


1 ^ ' 







I 

—S 


25 

—2 

Total 

I 

1 

4 ! 

11 

2 

1 

4 

3 

2 

2 

29 


2 

122 

2 


T(X) =2 X = 2/29 = -0690 

^(y’) = 2 F = 2/29 = -0690 

t{x~) = 102 Sxx = 102 — 2 X 0690 = 101-862 

T(y-) — 122 = 122 — 2 X 0690 == 121-862 

T{A'y)=(—3 )x(-2)H-(-2)x(— io)+(—i)x(—2)+ . . . +4x7 

= 73^4X4+3X9+ • - • +(-5)x(-2)(check) 

SxY = 73 — 2 X 0690 = 72-8620 

^A'A' = 101-862/28 — 1/12 (Sheppard’s correction) = 3-56 

Vyy “ 121-862/28 — 1/12 — 4-27 

“^XY = 72-862/28 = 2-60 

^A' = \+’A'A' =1-89 

= V^’yy = 2-07 
rxY = '^^xyIsxSy = -66. 
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In the table the infant mortality is grouped in intervals of 10, i.e. 
from 160 to 169, from 150 to i 59 > and so on. Since the value is given to 
the nearest integer, the group from 160 to 169 really means from 159 5 
to 169-5, with centre point 164-5. convenience of calculation these 
values are coded, i.e. we take a second variable V taking the values 4, 3, 
2, 1 ... for the various groups, and therefore related toy by the equation 

y = 124-5 + 

The percentage overcrowding is defined as the proportion of dwellings 
with an average of two or more occupants per room. This again is 
coded according to the formula at = 17-45 + 5 The means, vari¬ 
ances, and covariances of X and Y are calculated in exactly the same 
way as for a discontinuous distribution, except that a Sheppard’s cor¬ 
rection equal to iV is subtracted from the two (coded) variances ^-a^v and 
Vyy (but not from the covariance). These values for the “coded” vari¬ 
ables X and Y can be readily translated into the original variables x and 
y by using formulas (20.19) and (20.25). 

X = 17-45 + 5^ = >7-80 

y = 124-5 + 10^ = 12519 

= 5^ = 89-0 

Vxv= 5 X tOVxy = 130 

^vv ~ lO^Vyy =427 

= 5 ^a' = 945 

= 10 Sy = 20-7 

The value of the correlation coefficient is the same for .v, y as for 
X, Y, since it is independent of origin and scale of measurement. 

By analogy with the histogram of a one-variable distribution, we 
can represent the two-variable distribution of 'I'able 20.4 by a “stereo¬ 
gram”, as shown in Fig. 20.5. Here a pillar is placed on each cell of the 
table: the volume of the pillar rcpre.sents the observed number of cases 
in the cell. (In this case all the pillars have equal bases, and so the 
volume is proportional to the height.) This figure also shows the histo¬ 
grams of the two marginal distributions of x andj alone. 'I'lie correlation 
between x and y is vividly shown by the “range of peaks” running 
diagonally across the figure. 

In a small sample such a stereogram is necessarily irregular. But in 
most distributions if we take a very large sample and a narrow interval 
of grouping the upper surface of the stereogram will tend to a smooth 
surface z <^(.v, y), where z represents the vertical height above the 
(a, y) plane. <^(.v, _y) is the “probability density” or “distribution func¬ 
tion” (see p. 584), and has the interpretation that the chance of obtaining 
a value of x between a, and x^ 4- Bx, together with a value ofy between 
y, andy, 4- By, is approximately f{x,y) Bx By, provided that's.v and By 
arc small. The chance of obtaining a value of (x, y) somewhere in a 

given range R can be obtained by integration: it is f f 4 i{x,y)dx dv 
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Fig. 20.5 —The distribution of infant mortality and overcrowding in London 
boroughs in 1911, represented by a stereogram, with marginal histograms 


(see Section 16.13). By arguments similar to those in the one-variable 
case, the true mean of .v can be shown to be J J x ^(.v, y) dx dy, 

the true variance of x to be 

I'x* // {x “ Mr)- y) dx dy 

^ f I •>') ~ 

and the covariance 

= j J (x — ^x) {y — Mv) y) dx dy 

^ f I '“^y y^ ^y ~~ 


All these integrals are supposed to be taken over the whole range of 
values of .v and y. If we know the form of ^(.v, y) these formulas enable 
us to compute the true means, variances, covariances, and correlations. 
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But as a rule we have to be content with the sample variates as estimates 
of the true values. 

20.15 Two-variable normal distributions 

In many distributions the stereogram of a large sample approximates 
to a surface of the form shown in Fig. 20.6, with a single peak and a 


// 







. s. 


\ V 
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rapid fall on all sides. As in the one-variable case we can imagine an 
ideal form of distribution, often closely approximated to but never com¬ 
pletely realized in practice. This form is the two-variable “normal” or 
“Gaussian” distribution, defined by the equations 


z 

where <o 
and Q 


= <ff{Xyy) = (27r)-^a>“* 

= (x — (:r — /x^) {y — fx^,) + 

(20.27) 


Here Is the true mean of Xy fiy that of j, u^z, Vvv ^xv are the true 
variances and covariance, c* and Oy the true standard deviations, and /> 
is the correlation. That this is so can be verified by direct integration, 
using the formulas of the preceding section. But as such integration is 



Stature of Father: inches 


Fig. 20.7 —Contour lines for the observed distribution of heights of fathers and 
sons in a sample of X078 sons, and the corresponding contours for a normal 

distribution 
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rather complicated (though not really difficult) the details will not be 
given here. 

We shall also omit the rather complicated considerations which lead 
to the formula; it seems enough to state that such a distribution will 
occur when the variations in x and y are both due to a number of small 
causes, none of which produces any very large elfect when acting on its 
own. If some of these causes affect both .v and j there will be a correla¬ 
tion. Thus a classical example is the correlation between the heights of 
father and son. We can plausibly assume that height is affected by a 
number of different genes, and probably also by various environmental 
factors. Now father and son will share some of these genes and other 
factors, but not all. In Fig. 20.7 the contour lines are plotted for the 
distribution surface of a sample of 1078 pairs of fathers and sons, 
together with the theoretical contour lines of a normal distribution. 
Considering that the inevitable fluctuations due to sampling are bound 
to produce irregularities it is evident that there is very good agreement. 


20. x 6 Interpretation of correlation 

We have shown above that there was in 1911 a correlation of -66 
between infant mortality and percentage o\ercrt)wding in London. 
What does this mean? 

A priori there are five possible interpretations. 


(<2) Overcrowding causes an increase in child mortality. 

(6) Child mortality causes an increase in overcrowding. 

(c) There are common causes which increase both overcrowding 
and child mortality— for e.xample, general poverty. 

{(I) '^I'he apparent correlation is purely accidental: it simply happened 
that there was in 191 i high mortality in the overcrowded boroughs, and 
if we took another year we slunild get a ditierent elfect. 

(e) 'Fhere is some way in which we have chosen the figures which 
causes an apparent or spurious correlation, whereas there is no real con¬ 
nection. For example, the tendency in some parts of London to live in 
flats, and in other parts to live in houses, might affect the distrilnirion, 
so that the apparent correlation of mortality with overcrowding might 
really be a correlation w ith type of dwelling. 


None of these explanations can be completely ruled out. Hut there 
are statistical “tests of significance”, to be discussed later, which greatly 
reduce the chance of being misled by purely accidental correlations, as 
in explanation {d). Apart from that, only common sense and a proper 
grasp of the qtje^tion at issue can be used to decide between the various 
possibilities. I lore common sense suggests that {b) is absurtl ami (c) un¬ 
likely. In a well-coiulucled invcstigatic>n case (e) shouhl not occur; that 
is, no spurious effects should be produced liy the way the material is 
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selected, or by any cause other than the one which is being investigated. 
Unfortunately this is usually easier said than done. 

In certain cases statistical techniques can be used to reduce the 
possibility of a misleading experimental result. For a more detailed dis¬ 
cussion the reader is referred to textbooks on this subject, such as R. A. 
Fisher’s classical treatise, The Design of Experiments (Oliver and Boyd). 

20.17 Mean and variance of a sum 

In Table 20.3 we have a (hypothetical) sample of families classified 
according to numbers x of sons and y of daughters. We could also classify 
these according to the total number of children t = x +y (Table 20.5). 


Table 20.5—Distribution of total number of children 


Total children, / .. 

1 

0 

1 

I 

' 

2 

1 

1 

3 

4 

5 

6 

7 

No. of families ft .. 

20 

31 

22 

1 

1 

14 

6 

5 

I 

I 

1 


These numbers are obtained by summing diagonally. Thus the families 
containing 2 children can have 2 boys, o girls, or i boy, i girl, or o boys, 
2 girls; their total number is accordingly/20 +/n + /oa* 

From this distribution we can obtain a mean and variance of t in 
the usual way. But a quicker method is to use the following formulas 

i=z X ~^y 

’^'tt = . * • (20.28) 

i.e. ? = *95 + 84 = 179, t = *9571 + 2 X -2141 + *9438 = 2-3291. 
I’he equation for the means is derived from the one for the totals 

T{t) = T{x) + r (30 . . . (20.29) 

by dividing through by the sample number n. Equation (20.29) nierely 
states that the total number of children T{t) is the sum of the total num¬ 
bers of sons ^(x) and of daughters T(y') respectiveljs and scarcely needs 
formal proof. The equation for the variance of t is obtained by dividing 
through by {ti — i) the corresponding equation for deviances: 

Stt = *5x1 + Sy^ . . (20.30) 

This equation is proved as follows. If f = at + j is the total children 
for any one particular family, 

(i — = [(jf -h^*) — (;v + j)]" (by 20.28, first equation) 

= [(x — x) + (y — y)Y 

= (x — Xf + 2{x ~x)(y ~y) -{.(y —yy 
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Since Sft is by definition the sum of all the squared deviations (/ — f)* 
for all the families, and and are similarly defined, equation 

(20.30) follows by summation. 

In the same way we can consider the variable u = x — y and find 
its distribution (Table 20.6). The corresponding formulas for tt are 


Table 20.6—Distribution of u 


Value of M 

1 

-3 

1 

1 

^ —2 

( 

1 

— I 

0 ^ 

1 

2 

3 

No. of families . . 

1 

I 

• 

1 

! 7 

23 

' 33 

1 

24 

1 9 

' 3 


easily shown to be 

T{u) = T{x) - T{y) 

^UU ^XX ^^xv “h *^vv • • (^^*3 0 

whence by division by n and {n — 1) respectively we obtain 

u = X — y = *95 — 84 = • 11 

-f- Vyy -- -9571 — 2 X -2141 + -9438 = 1-4727. 

Alternatively we can use these equations in the reverse direction. 
Given the distribution of x and y in 'I'able 20.3, we find those of .v, y, 
t = X y and u = x — y separately, as in 'Fables 20.5 and 20.6. From 
the distributions of x and y we find ^(.v), 7 '( v), and S^y in the usual 
way. The value of T{t) obtained from the distribution of / gives us a 
check on T{x) and T{y), by (20.29), value of iS*,, enables us to 

calculate the codeviance Sj.^ from (20.30). Fquations (20.31) provide 
a further check when 7 ’(w), have been found from the distribution 
of u. From 7 '(.v), T{y), and Syy wc then find the means, 

variances, covariance, standard deviations and correlation as in Sections 
20.12 and 20.13. This provides an alternative method of calculating 
Sxy, and many computers prefer it to that explained in Section 20.12, 
as it is more expeditious and has more checks. 

More generally we can show that i( w — Ax By, where A and B 
are constants, then the mean and variance of tv are given by 

te = Ax d- By 

z\,y. = A'^Vy.^ + zABv^y 1- BH\y . (20.32) 

Similar relations will hold for the true mean and variance of w, which are 
no more than the limits of the sample mean and variance as the sample 
size tends to infinity. So if — Ax + By 

V,, - 1 - 2AB -{ B^ Vyy . ( 20 . 33 ) 
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20.18 Range of values of a correlation 

^>et w = Ax + as in the preceding section. Then .Suiy, is the 
sum of squares of deviations from the mean, by definition, and is there¬ 
fore never negative, since no square (of a real number) is ever negative. 
So > o. Since — Sy^y by definition, 

it follows from (20.32) that 

A-sJ- + 2.ABrxyS,Sy + > o 

whatever values we choose for A and B. Let us therefore choose A to 
be Sy and B to be —r^yS^ (so that zo = SyX — r^-yS^y). We then obtain 

i.e. ^*V(i —?**«“) >0 

But Sx“ and Sy- are necessarily positive, and it accordingly follows that 
I _ rxy‘ > o, i.e. Txy^ < I, i.e. r^y lies between —i and +1. Further¬ 
more it can only actually take the value — i or +i if the corresponding 
variance of to is exactly zero, i.e. if w = Ax + By is constant. But 
+ By = constant is the equation of a straight line. Thus Zxy can 
only be ±i when there is a perfect linear relationship between .v and y. 

20.19 Many-variable distributions 

We can have distributions of more than two variables. For example 
we could measure the numbers x, y, z and tv of members of a family 
Having blood groups O, Ay B and AB respectively; if we do this for 
many families we shall find fxytw (say) families of given constitution 
out of n selected. When w becomes large the relative proportion/xysu./// 
can be imagined as tending to pxvzw> the true probability that a family 
chosen at random will contain x O’s, y .^’s, z B's, and tv AB's. This is 
an example of a discontinuous distribution. If we measure the height Xy 
weight v and age a: of all members of a population we shall have a three- 
variable continuous distribution. In this case we can imagine that there 
is a probability 2^ v,, ~,) 8.v 8y Bz of obtaining a person with 

height between .v, and .v, + 5 .v,, weight between y, andjx -h By^, and 
age between .3:1 and Mere, again, <;6(.Vi, yi, Zi) is known as the 

“distribution function” or “probability density”.' 

'I’here is no convenient graphical representation when there are 
more than two variables. However by considering each of the variables 
.V, V, z separately we can find the means x, y, . 5 , . . .; it is convenient to 
look upon this set of means as a vector 

m = [.V, y, .s, . . .] (see Section 18.7) 

We can also find the variances v^xf • • . respectively, of each of 

the variables; and by considering the distributions of pairs of variables 
we obtain the covariances Vxy, t’xc, . . . and correlations ... It is 
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convenient to imagine these set out as matrices (which must be sym¬ 
metrical, since v^y — Vy^, r^y = etc). 

ryjT I ry, 

rzx r.y I 

These are known as the “covariance matrix” (or “variance matrix”) and 
“correlation matrix” respectively. When the sample number « is large 
the observed mean x will approximate to the true mean the observed 
variances and covariances to their true values v^.^, v^y, etc., so we can 
speak of a true mean vector (x = and a true covariance 

matrix u, and similarly of a true correlation matrix p. I'he inverse 
i= v~^ of the variance matrix is known as the “invariance matrix”; 
there will be a corresponding “true invariance matrix i”. 

If / = Ax -b By 4 - Cz -f- ... is any weighted combination of the 
variables .v, y, z . . . the sample mean and variance of t will be 

i — Ax + By + C.V 4 . . . . 

Vi, — AH'xx 4 B'^Uyy 4 4 ... 4 zABr^y 4 . ■ - (20.34) 

respectively, and the corresponding true means and variances 

== = Afi^ 4 Bfiy -1- CfL. 4 . . . 

v,t — A-u^j. 4 B'vyy 4 4 . . . -f zABv^y 4 ' . . . (20.35) 

By writing the set of weights A, B, ... as a row vector A = [A, B, C . . 
these formulas can he neatly summarized in matrix notation 

i = Am', n, = All', V,, AvA', v,, = A\jA' (20.36) 

A specially important case occurs when .v, y, z, are in(ie[>en<lent vari¬ 
ables, so that all the true covariances v^y, etc. arc zero (Section 20.13). 
If A, B, C, etc. arc then all equal to i, so that / = a- 4 J 4 ~ 4 ... is 
the ordinary unweighted sum, we have by (20.35) 

^ + ^yv 4 t'se 4 . . . . . (20.37) 

The true variance of a sum of independent variables is the sum of their 
variances. ('I'his will not be necessarily exactly true for the sample 
variances, owing to sampling fluctuations). 

20,20 The multinomial distribution 

Suppose that two persons of blood-groups MN marry. 'I'hen by 
Mendel’s laws (Section 19.6) we know that on the average } of their 
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children will be MM, \ will be MN, and J NN. But if they have four 
children we shall not necessarily expect exactly one to be MM, two MN, 
and one NN. Any combination can result, and if we consider a large 
number of families we shall obtain a distribution of numbers of the 
three types of children. This is known as a “multinomial distribution”. 

In general, suppose that an event a occurs with probability p, an 
event ^ with probability q, one y with probability r, and so on. Then 
we know that in ?i successive events we have a chance 

of obtaining exactly -v events a, y events / 3 , z events y, . .. By repeating 
the whole set of w trials many times we shall obtain a joint distribution 
of the numbers .v, y, z, . . . 

We can readily calculate the true means, variances, and covariances 
of X, y, and z in this distribution. This has already been done for the 
binomial, which is the case in which we have only two classes of events 
(Section 20.6). W’e can avoid complicated recalculations by the simple 
trick of combining classes together. Thus suppose we imagine all classes 
y, S . . . other than a are lumped together into a single class. The 
distribution is thereby reduced to a binomial; the probability of any 
single event being in class a is /), and of being in the remaining com¬ 
bined class is I — p. The number .v in class a accordingly has mean and 
variance respectively 

= ==«/>( I “/>) • ■ (20.38) 

Similarly = nq, = nq{i — q), and 'these formulas must apply 
equally to the original distribution. Now suppose that we lump 
together classes a and ^ into one, with probability (p + q) and actual 
observed number t = x + y, the remaining classes are also combined. 
The true variance of / is therefore = n{p + ^)(i — p ~ q). But by 

( 20 - 33 ) 

-b 2 V^y + or 

ti{p (7) (i - p —q) = np {i —P) 2v^y H- nq{l — q). 

On solving this equation we find that the covariance is 

t'xv = —fipq - - - (20.39) 

It is often convenient to consider the obser\'ed proportions, say P = x/n, 
Q ~ ^ • • • instead of the observed numbers: these propor¬ 

tions are estimates of the true probabilities p, q, r, . . . of falling into 
their respective classes. From equations (20.20), (20.25), h^ve 

fip = py Vpp p{i ^ p)ln, vpQ=—pqln . (20.40) 

We can also fiiid the mean and variance of any weighted combination 
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w =5 AP + PQ + CR + ... of these observed proportions, from 
equation (20.33). 

= Ap -\-Bq -\-Ct -f- , , , 

^tcw = • • • • — zABpq—zACpr — . . .]/« 

= A^p-\-B^q-\- . . . — A^p^ — B^q ‘— ... — zABpq — zACpr — ..]/« 
= A^p-\-B^q-\- . . . — {Ap-\-Bq+Cr+ . . .)^]/« 

= \A^p-{-B^q-\- . . . —fxj'ijn . . . (20.41) 
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SIMPLE STATISTICAL PROCEDURES 


21.1 Estimation 

An estimate is a guess at the value of a quantity which is not exactly 
known. Suppose we find, on examining n — 400 men chosen at ran¬ 
dom, that X = 32 of them are colour-blind. What is the true frequency 
p of colour-blind men in the general male population? We cannot tell: 
but it is reasonable to take the observed proportion P =■ x/n = 32/400 
= -oS, or 8 per cent as an estimate. But we shall also wish to know how 
near this is to the true value, i.e. what error we can expect. 

Suppose we repeated the procedure manv times over, each time 
selecting 400 men and finding the proportion P of colour-blind indi¬ 
viduals. We know that .v, the actual number found, will vary from 
sample to sample; it will in fact have a binomial distribution with mean 
fip and variance iip{i —/>), or standard deviation \ [///>(i —/>)]. We 
also know that this binomial distribution is nearly normal, or Gaussian. 
Thus P ~ .V n will also vary from sample to sample according to a nor¬ 
mal dislrilnition, with mean /> and standard deviation <7 — —/>)/”]• 

From the taldes of the normal integral in the Appendix it can be seen 
that a normal variable ditfers from its true mean by less than 1-96 a in 
every 19 times out of 20. 'This is illustrated in Fig. 21.i; the shaded 



It' .V, the i\k \ iaiidti troni the moan, is equal to i 960, the sum of the shaded areas is 05. 


area represents the chance of a deviation of more than -V = 1-96 a from 
the nieaiu and constitutes only ,;'.,th part of the total area under the 
curve. In our case we can say that in 19 samples out of 20 the true 
value p will dilfer from the observed value P by not more than 1-96 X 
\ [/'(^ ■ P) gives a measure of the error we can expect. We 

slili do not know the value of \ [/)(i — /’),'/]• unfortunately, since p is 
u..known. But in most practical cases it is good enough to sav that this 
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is approximately equal to since the observed value P 

will not be very far from the true value p. (This argument is admittedly 
rather crude, and it is possible to improve on it; but the improved ver¬ 
sions are rather more complicated and will not be discussed here.) Thus 
we can say that the true value p will not differ from the obser\'ed value P 
by more than 1-96 \/[P{i — ^)/w] in 19 out of every 20 samples. In our 
case this is 1*96 \/[-o8 x *92/400] = *027, i.e. the true proportion of 
colour-blind men probably lies between *08 — 027 = 053 and 
■08 -h 027 = 107. 

In general, suppose that tj is some unknown quantity or “parameter’’ 
connected with a distribution. Suppose further that a sample is provided 
of n objects drawn from this distribution. Then it is generally possible 
to calculate from this sample some estimate h of the unknown para¬ 
meter 77. Now by repeating the calculations on a number of successive 
samples of « we shall in general find a different value of /i for each sam¬ 
ple. However it is shown in books on the theory^ of statistics that when 
the sample number n is large, h can usually be expected to have a nearly 
normal distribution, with mean value nearly equal to the true value 77, 
and with a certain suindard deviation which is usuallv called the 
standard error of h. It follows that in 19 samples out of 20 the observed 
value h will not differ from the true value 77 by more than 1*96 times the 
standard error, or nearly enough, by more than 2a,,. 'I'hus 77 can be 
expected to lie in the range from h 2 ^,, to h 2a,,. 'rherc is of 
course no special magic in the proportion “ 19 times out of 20’’; we can 
equally well deduce from tables of the normal distribution that in 99 
cases out of 100 77 will lie between h - 2-5877,, and // 4- 2-58r7ft, or in 999 
cases out of 1000 between // — 3-2977,, and h 4- 3-29<7,,. but 19/20 is a 
rather convenient proportion to take. Notice too that this rule only 
applies, strictly speaking, to large samples. 'The theory of estimatioji 
for small samples is very much more complicated, and will not be con¬ 
sidered here. 

Consider as an example the problem of estimating the mean of a 
distribution of a single variable .v. Let the true mean be /t; the obvious 
estimate is x = (v, 4 - x.. 4 - . . . 4 - -v,,)///. Suppose furtlter that the true 
variance of the distribution of .v is f ^ 77^; so the true mean of .v.'« is /t «, 
and its true variance is vjn'^ = by formula (20.20). Now .v can be 

regarded as the sum of n independently chosen quantities a*, //, .Vg///, . . . 
A^n/w. In repeated sampling it will have a certain distribution, by 
(20.35), (taking A -- B - - — i), its true mean will be the sum of 

the true means of .v,/«, x Jn, .... x,Jn, i.e. ft n 4 4 - . . • 4 - t‘- 'n 

n(filn) = fx. 'I'he true mean of the sample mean, in repeated samplinj’, is equal 
to the true mean of the distribution. Furthermore by (20.37) the variance 
of X will be the sum of the variances of .vfn, x.Jn, . . . xjn, i.e. o-jn- 4 - 
-4 . . . 4- rx2///2 no^jn'^ -- a~ln. 'Fhe standard error of x will 

therefore be 


v' (^^ 7 ") = '^/V" 


(21.i) 
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The Standard error of the sample mean is equal to the standard deviation of 
the original distribution divided by the square root of the sample number. 
In practice we shall not usually ^ow the true value of C7, and will have 
to use the estimated deviation s in its place. 

EXAMPLES 

(1) In a sample of 6293 babies (discussed in Section 20.8) we found 
a mefin gestation time of 281 *53 days, with standard deviation s = 15*06 
days. Within what limits can the true mean gestation time be expected 
to lie? 

The standard error of the mean is which we estimate as sjy/n 

— 15*06/^/6293 = *19 days. The true mean probably lies therefore 
benveen 281*53 — 2 X *19 = 281*15 and 281*53 + 2 X *19 = 281*91 
days. 

This example emphasizes the cautions in the interpretation of statis¬ 
tical results already noted in our discussion on correlation (Section 
(20.16). In the present example we have reduced the statistical or ran¬ 
dom error in the determination of mean gestation time to rather less than 
one day. But it can scarcely be claimed that the original estimates of 
gestation time for the individual babies had such an accuracy. Thus we 
have only determined the mean of the presumed gestation time within 
fairly close limits. This may still differ appreciably from the mean of 
the true gestation time. In short, statistical methods cannot cure defects 
(not always avoidable) in data. 

(2) This formula for the variance of a mean enables us to give a 
further justification for the rule “to find the estimate v of the variance, 
divide the deviance S^x by (n — i)’\ For if x is any variable quantity, 
with true mean fx and variance v, equation (20.6) can be rewritten as 

V = T p^ (X^ — 2XfJ, + ix^) 

= ^ PxX^ —2fx£p^v fx^zp^ 

— ^ X^ ~ 2ix . fi fx^ . I 

= t 

or L .V® = V, 

and therefore since x has mean fx and variance v/riy 

^ X“ = fx^ v/n. 

Now the deviance Sxx is defined as 

Srx = + .Vn® -f- . . . -j- .v„2 _ „x^ 

where .Vj, .Vg, • . . v,, are the sample values. Now each of the terms x^^, 
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^2*. • . • Xj? is on the average equal to /i* -{- u, while nx- is on the average 
-f- v/n) ~ fifj.- -f V. The average value of is therefore 

^ Sxx = + w) + + u) + • . • + + I') 

= 7l(fL^ + + u) =; (n — l)u. 

On the average, therefore, the deviance Sxz is equal to (n — i) times 
the true variance v. It therefore seems reasonable to divide Sgx by 
{n — i) to obtain an estimate of v. 

It is usual to write an estimate with its standard error in the form 
281’53 it *19 days. A little care is needed, because in older books and 
papers the “probable error” (P.E.) is used instead. This is about two- 
thirds of the standard error. (More accurately, P.E. = *6745 S.E.) 

Thus the estimate would be written as 281-53 ±12 days in the older 
form, where *12 is the probable error. However this use of the P.E. 
seems to have little to recommend it, and is rapidly being superseded by 
the standard error. 

The following table shows the most important standard errors of 
estimates. 


Table 21.1—Standard errors 


True value 

Sample estimate 

Estimated standard error 

Proportion, p 

P = xjti 

V[P{1 - 

Mean, /Xj... 

X 

SriVn 

Variance, 

Vxx = Sx^ 

Vxx V2/ \/n 

Standard deviation, . . 

St 

Sx!v'i2ri) 

Covariance, Vxy . . . .' 

Vxy 

Vl'^'xr^'yy -\-Vxy-]/\'n 

Correlation, p^y . . 

1 

' ''xv 

t 

1 

(1 - rxy^)l\'Ti 

1 


EXAMPLE 

(3) We found the standard deviation of gestation time to be 15-06 
days. How accurate is this estimate? 

Its standard error is SxlV{2n) — 15-06/\/12586 = '14. Thus the 
true standard deviation probably lies betwecMi 15 06 — 2 x 14 = 
14-78 and 15-06 q- 2 X -14 = 15 34 days. (Again this is the standard 
deviation of the presumed rather than the true gestation time.) 

Actually the last four standard errors given in the table are based on 
the assumption that the original distribution of x is normal. If it is not, 
they will strictly speaking cease to apply. But most distributions en¬ 
countered in practice are nearly eitough normal for the formulas to be 
reasonably accurate. In addition they are only true for large samples. 
The formulas for the standard errors of the mean x and standard 
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deviation Sx are applicable to reasonably small samples without serious 
danger. Fisher has invented a special device for dealing with the 
correlation r in small samples. Instead of r consider the quantity z = 
tanh-*r; this has true value C = tanh-^ p and standard error approxi¬ 
mately i/V(^ — 3)- 


EXAMPLE 

(4) R. A. Fisher and E, Anderson found a correlation -32 between 
the width and length of a petal in a sample of 50 flowers of the species 
Iris virginica (A?ifi. Eugen. Lond.y 7 (1936), 186). Between what limits 
does the true correlation p probably lie? 

We have xr = tanh”^ *32 = 33, with standard error i/\/(5o — 3) = 
1/V47 — '146. The true value ^ = tanh“^ p therefore probably lies 
between ’33 — 2 X -146 = *04, and *33 + 2 X -146 — -62, i.e. p lies 
between tanh -04 = ’04 and tanh -62 = •55. 


21.2 Significance tests 

An important part of scientific experimentation is the comparison 
of theory and observation, with the object of verifying or disproving the 
theory. In physical sciences it is usually possible to make such a com¬ 
parison with great precision, but in biology there is often a special 
difficulty introduced by random fluctuations. Suppose for example that 
a geneticist believes that a certain character, say that of yellowness in 
peas, is due to a recessive gene _v. He grows 100 peas from a mating of 
two hvbrids, presumed to be both Yy. If his theory is correct he 
should obtain on the average 25 per cent yellow peas, and 75 per cent 
green; i.e. he will expect to find 25 yellow peas out of his 100. If he 
finds 22 or 28 this rs clearly a reasonable agreement: if he finds 
only 2, it is clearly unreasonable, and the theory can be regarded as dis- 
provcil (or alternatively his technique is bad). Where is he to draw the 
dividing line? This is an arbitrary division, for however few or however 
many yellow peas he got it would still be possible to regard the result as 
due to chance variation. But there are some results which, though 
thcoreticallv possible, are so unlikely that their occurrence amounts to a 
dispn>of of the theory; such results are called “significant”. 

We can obtain a reasonable division between “significant” and “non¬ 
significant” results in the following way. If the true proportion of 
yellow peas is /) — ], and if we perform the experiment of growing 100 
peas many times over, we shall find that the observed proportion P will 
be distributed nearly normally with mean | and standard error 

ap - \ [/)(i — p).‘n] y (-1 • 4 too) -0433. In i9samples out of 20, 
therefore, the observed proportion P will not differ from the true pro¬ 
portion /> by more than 2ap — ’oSy; i.e. P will lie between -163 and *337. 
If we find a value of P outside these limits, say P — -lo, this is clearly 
suspicious. Such a value will only occur once in every 20 samples if the 
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theory is true. It is said to be significant at the 1/20 or 5 per cent level. 
If P differs from p by more than 2 58 times the standard error it is said 
to be significant at the i per cent level; for such a value can only occur 
by chance once in every hundred samples. If the deviation is more than 
3-29 times the standard error this indicates significance at the 01 per 
cent level. In general one can say that a 5 per cent significance is a 
warning that the theory is probably untrue, a 1 per cent significance is 
quite a good indication of this, and a -i per cent significance is virtually 
a disproof—always assuming that the experimental conditions are good. 
However it is always dangerous to take any single experiment as con¬ 
clusive evidence; it is the possibility of repeating the experiment which 
brings conviction, and this conviction is strengthened if the conclusion 
can be supported by independent evidence. 


21.3 Standard tests 

Suppose that a reasonable hypothesis states that the true value of a 
certain quantity, or “parameter” to use the technical term, is -q. Sup¬ 
pose further that we take a large sample, and find some estimate h of q, 
with standard error cx/,. 'rhen if h differs from q by more than 2a/,, we 
say it is “significant at the 5 per cent level”. If the deviation is more 
than 2-58a/,, it is significant at the i per cent level, and if niore than 
3‘29a/,, at the o-i per cent level. 'I’he smaller the level of significance, 
the more convincing is the disproof of the theory. 

More frequently we require to know whether two samples can 
reasonably be supposed to have the same mean, or the same variance, 
or whether they are drawn from the same population. Suppose that .v' 
is the mean of the first sample, and .v" tliat of the second. If x has 
standard error a', say, i.e. a variance o'- in repeated sampling, and .v" 
has standard error a", then .v' — x" will by formula (20.35) have vari¬ 
ance a'2 -f- o H- (— 1)^ o"- -- o"^ H- a"^, and therefore a standard error 

But if the two samples are derived from distributions 
with the same true mean, x' — .v" will on tlte average be zero. So this 
assumption is contradicted, i.e. the distributions have significantly 
different means, if \x' — x"\ exceeds 2\ (a - a'-). 


EXAMPLE 

(0 7^37 male babies have an average weight of 7 27 I -02 lb, and 
6693 females a mean weight of 7 06 j -02 lb (Karn and I’enrose). Is 
there a significant difference? 

The difference in means is 7 27 — 7 06 • - *21, and its standard 
error is -f *02“) = -03. 'I'hus the difference amounts to about 

seven times its standard error, and is highly significant. 


In general, if h' and h" are two estimates, derived from separate and 
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independent samples, of what is believed to be a single parameter rf 
common to both samples, we shall calculate the value of \h' — h''\ 

and treat this as normally distributed with 2ero mean 
and unit variance. 


FURTHER EXAMPLES 

(2) In early experiments involving the segregation of the genes for 
“normal” and “grey lethal” in the mouse. Dr. H. Griineberg found 
469 normals and 100 grey lethals. In later experiments he found 189 
normals and 74 grey lethals. Are these proportions in reasonable agree¬ 
ment? 

The estimated proportion in the first set of experiments is P' — 
100/(100 + 469) = 100/569 = *176, with standard error estimated 
as a' = \/[P'{i — P')ln’'\ = '\/[*i76 X -824/569] = ^ 000255 = -0160. 
In the second set the estimate is P" = 74/(74 + 189) = 74/263 = *281 
with standard error \/t‘28i X -719/263] — •\/'000768 = -0277. The 
difference between the proportions is therefore P' — P" — -176 — -281 
= —*105 with standard error a"-) — \/{'0002$$ -h -000768) 

— -032. This difference is therefore more than three times its standard 
error in absolute value, and is highly significant. 

We also see that the proportion -176 ± -016 obtained in the early 
experiments differs from the theoretical proportion of -25 for a recessive 
by 4-5 times its standard error. This may be plausibly interpreted as 
meaning that a large proportion of grey lethals are lost before birth. On 
the other hand, in the later experiments the observed proportion, *281, 
differs from -25 by only a little more than its standard error -028. Thus 
the Mendelian expectation is then realized. 

(3) Dr. S. B. Holt found the correlation between the ridge-counts 
of the finger-prints of the thumbs of the right and left hands to be -61 in 
TOO males and -55 in 100 females. Do these differ significantly? 

It is best to use Fisher’s xr-values, z == tanh”^ r, with standard error 
i/vX« — 3)- For males z' — -71 rh i/V 97 » for females = *62 ± 
i/\/97 . Thus the difference z' — z" = 09 is less than its standard 
error \/(yV ■+- tfV) = ‘^4» ^^d is accordingly not significant. 


\\"hen we have in this way two or more estimates, h\ h'^, h '",.. . 
of a single parameter rj, derived from independent samples, we can 
combine them to obtain a better estimate by weighting them by the 
reciprocals of their respective sample variances 

h = -h h^ls"^ -h /z” 7 ^'” 2 )/(i/j '2 + 1/^2 i//" 2 ) . (21.2) 

where s\ j", and s"' are the standard errors of h\ h", h'” respectively. 
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The standard error of h is then + ijs"'^ + lA If we sup¬ 

pose that there is the same true correlation between the finger counts of 
the right and left thumbs in males as in females, we can estimate it by 
combining the ‘V’ values. 

2r = {z'is'^ -f- 

= (-71 X 97 -f- 62 X 97)/(97 + 97) = -665, 

with standard error i/\/(97 + 97) = 07. r ~ tanh 2 = -58. 

21.4 Chi-squared 

The tests described above are valid for large samples; for small 
samples they will be inaccurate to a smaller or larger degree owing to the 
approximations involved, chiefly in taking the distribution of the esti¬ 
mate to be normal. A considerable amount of research has been done, 
especially by Fisher and his school, on the problem of constructing 
tests which can be safely used for small samples. Here we mention 
only three tests which cover a very large range between them: with 
suitable adaptations they can be made to cover almost all practical situ¬ 
ations. 

The first test is the test, originally due to K. Pearson, and im¬ 
proved by Fisher. The object of this test is to find whether the pro¬ 
portions observed in a sample are in reasonable agreement with those 
predicted from theoretical considerations. For example, in an experi¬ 
ment which was expected to give a i : 2 : i ratio, or which, speaking 
more accurately, should give such a ratio in a sufficiently large sample, 
the actual numbers observed in three classes, A, B, and C, were 28, 45, 
and 27 respectively, summing to n = 100. Can we consider this a 
reasonable agreement.^ 

The first step towards testing this point is the division of the total 
number, 100, in the theoretical ratio i : 2 : i, obtaining the numbers 
25* 50 and 25 respectively. 'I'hese are usually called the “expected" 
numbers. In the first class A the observed number is 28, the expected 
IS 25; the deviation from expectation is therefore 28 — 25 = 3. Simi¬ 
larly in Class B the deviation is 45 — 50 = —5, and iti the Class C it is 
27 — 25 = 2. Now clearly if our theory is correct, and the true ratio 
IS I : 2 : I, the observed numbers will be nearly the same as the “ex¬ 
pected” ones, at least in the great majority of cases, and the deviations 
will be small. So at first sight it would seem natural to add the devi¬ 
ations to give a measure of the total discrepancy between the observed 
and expected numbers. There are two objections to this. Firstly, some 
deviations are positive, and some negative, so if we add them as they 
stand we merely obtain zero [3 — 5 +2 = 0]. 'Phis difficulty can be 
overcome by squaring all the deviations to make them positive. Sec¬ 
ondly, we can reasonably expect the larger numbers to deviate more from 
expectation than the smaller ones; an unweighted sum of squares would 
be unfair on the smaller classes, as their contributions would be swamped 
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by those of the larger classes. This can be corrected by dividing the 
squared deviation by the expected number in each class. We therefore 
take 



deviation^ 

expectation 


(21.3) 


as a measure of discrepancy bet^veen theory and observation, 
example quoted 



I 02 


In the 


If y- is unreasonably large the theor>' will be disproved; but how large 
is “unreasonably large”? The answer is surprisingly simple, though the 
proof is complicated, and will not be given here. The value of which 
is to be considered significant depends (to a good approximation) only 
on a single number v which is known as the “degrees of freedom”. In 
the case of a straightforward comparison between theory and obser¬ 
vation this number v is simply one less than the number of classes, i.e. 
in our case i- ~ 3 — i — 2. The significance points for x“ for varying 
numbers of degrees of freedom are given in Appendix Table 6. In our 
case, V — 2, we see that y- is only significant at the -05 level when it 
exceeds 5*99; the calculated value i-02 is therefore non-significant, and 
gives no reason to doubt the agreement of theory and observation. 

'rhis use of x“ has one important qualification. It is really only an 
approximation, and cannot be considered reliable if the expected number 
in any class is less than five. In doubtful cases the approximation can 
be improved by subtracting I from the absolute value of each deviation 
before squaring: this “Yates’s correction for continuity” allows for the 
fact that the observed numbers are necessarily integers, and can only 
change by jumps of one. Thus in our example, instead of taking the 
deviations to be 3, —5, and 2, we should use 3 — ^ = 2*5, 5 — ^ = 4'5» 
and 2 — ^ -- 1-5, obtaining x* = {z-sYjT.^ + (4-5)2/50 + = 

•945, still non-significatu. 


PROBLEMS 

(1) In an generation involving the recessive genes for albinism 
and for congenital hydrocephalus in the mouse. Dr. H. Gruneberg 
obtained the following numbers: 

coloured normal 47 coloured hydrocephalus 13 

albino normal 17 albino hydrocephalus 4 

Do these agree with the expected 9:3:3:! ratio? 

(2) Gates and Pullig (1945) reported an experiment involving the 
genes for dominant spotting (ll'^ and recessive hairlessness (hr) in the 
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mouse. The cross was arranged to give a i : i : i : i ratio in the pro¬ 
geny if the genes were unlinked (i.e. not carried on the same chromo¬ 
some). The numbers obtained were: 

spotted normal 170 spotted hairless 124 

unspotted normal 131 unspotted hairless 180. 

Do these agree with expectation? 


Sometimes we may wish to know whether two or more samples 
agree reasonably well in their proportions, without having any previous 
knowledge as to what these proportions may be. For example, human 
beings can be divided fairly sharply into two classes—those who can 
taste phenylthiocarbamide (P.T.C.) in very dilute solutions, and those 
who can only taste it in strong solutions. Four different investigators 
found the following results. 


Table 21.2—Results of taste tests 


Investigators 

Population 

1 

Non*tasters 

1 

Tasters ^ 

1 Total 

1 

1 Proportion of 
non*iasters 

t 

Harris and 
Kalmus , 

English 

1 

54 (53-9) 

1 

1 

118(1181) 

1 

172 

3*'4 .h 3'5 

Falconer 

English 

161 (1971) 1 

46S (431'9) , 

1 h29 

25-6 dr 1-7 

Mohr 

Danish 

82 (78-7) 

169 (172-3) 

251 

32-7 d: 3 

Hartmann 

Danish 

222 (189-3) 

1 

382 (4147) 

604 ' 

36-8 i 2-0 

Total 

1 

5'9 

1 

1137 

1656 1 

31-3 : I-O 


(The numbers not in brackets are the observed numbers. 1 he figures 
are taken from J. Mohr, Ann. Eugen. Loud., 16 (1951), 288). As there 
is no a priori reason to expect any special [proportion of iKPn-tastcrs in the 
population, we estimate this proportion from the total figures, as 
519/1656, Similarly we estimate the [proportion of t.isters as 1 137 
We use these prtpportitpns to calculate “expected” numbers of non- 
tasters and tasters in each sam[ilc: e.g. in I larris and Kalmus s satn|ile of 
172 we expect 172 X 519/1656 = 53’9 n<pn-tasters, and 172 X 1137 
1656 = ii8-i tasters. 'I'hese expected numbers are shown in brackets, 
can now be calculated as liefore as T (deviation)','expectation : ,v" 

(S4-S3-9)’'/53-9 + (i 6 ' - ■9y)V>97' I-■ ■ ■, i (382 - 4.4-7)^’4'47 

= i8'8. It can be shown that we can still use the x” table lor the test 
of significance, provided that in no cell does the ex[pected v.ilue (ail 
below 5, and provided that we take the “degrees of freedom” i' to be the 
product (no. of classes ~ i)(no. of samples - i), i.e. in our e i:.e 

(2 — i)(4 _ i) = 3. Hut X' wit^t 3 degrees of freedom is significant 


BS 
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at the -oi level when > ii’35> thus the value x* = is highly 
significant. This means that the investigators disagree in the propor¬ 
tions of non-tasters they find. Mohr suggests that this disagreement is 
due to differences in the techniques employed, and not to marked racial 
differences between England and Denmark. 

If there are only two classes the calculation of x^ can be simplified 
as follows. Let Xi be the number in the first class in the first sample, 
y, the number in the second class, and Wj = Xi -|-yi the total number. 
Let X2, y2, W2 be the corresponding numbers in the second sample, 
.Vg, ^3, «3, in the third, and so on up to the Ath sample. Furthermore 
let X = x^ 4- ^2 + • • • + be the total number in the first class; 
Y = yi +^2 + . . . . + yk that in the second, and iV — + Wg + 

... 7 ik = X Y (check) the total sample number. Then, ignoring 
the correction for continuity 



(21.4) 


with (k — i) degrees of freedom. 


PROBLEMS 

(3) Apply (21.4) to Mohr’s data on taste-sensitivity. 

(4) Show algebraically that formula (21.4) is equivalent to the gen¬ 
eral definition {21.3) of x". 


In the particular case in which there are only two classes and only 
two samples the formula (21.4) reduces to 

= (-Vi>'2 — X2yi)~NIXYtiin2 - . (21.5) 

EXAMPLE 

(i) The following results concerning susceptibility to typhoid were 
found bv Greenwood and Yule. 


Table 21.3—Relation between inoculation and attack by typhoid 



! Attacked 

1 

1 

Not attacked 

Total 

1 

Inoculated . . 

5b (-Vi) j 

6759 Cvi) 

6S15 (Hj) 

Not inoculated 

272 (.Vo) 

11396 O'2) 

11668 (rtg) 

'hOTAL .. ..i 

328 ix) ; 

1 

18.5s (V) 

18483 (iV) 


X- = (56 . 11396 — 272.6759)2 , 18483/(328 . 18155.6815 . 11668) 
= 56-2 (i degree of freedom). 










SIMPLE STATISTICAL PROCEDURES 


627 

This is highly significant, indicating that the proportion of sufferers 
from typhoid is really different in the inoculated persons from its value 
in the non-inoculated. Indeed the proportion is lower among the 
inoculated. Assuming (as seems reasonable) that the sample has been 
properly selected this indicates that inoculation provides some degree of 
immunity, though by no means a complete protection. 

As we have said, this method applies only if all expected numbers 
exceed 5. Fortunately a complete solution is known for the 2x2 table, 
and tables have been prepared by D. J. Finney {Biometrika, 35 (1948), 
^45~*5^) cover the cases of small numbers in the cells. 

A modified heterogeneity test has recently been proposed, applicable 
to cases in which there are three or more samples, and any number of 
classes (C. A. B. Smith, Ann. Engen. Land., 16 (1951), 16-25). Suppose 
we have k samples divided into c classes, A, B, C, . . . Let .\v, Vr, -r • • • 
be the numbers of cases in classes A, B, C, . . . respectively in the rth 
sample, and ?ir = -h yr + + • • • the total number in the rth 

sample. Also let X = Zxa be the total number in class A in all samples; 
Y, Z, . . . are similarly defined, and A' = AT -j- Y Z + ^ Erin 

is the total number in all samples and all classes. We can calculate the 
expected number HrXIN in class A and sample r, exactly as for 
the difference (observed — expected) will be the corresponding deviation 
from expectation. For greater accuracy this deviation can be corrected 
for continuity by reducing its absolute value by i, if so desired. Simi¬ 
larly we can calculate all the deviations in class B, and so on. Instead 
of we now calculate 


^ _ .^(deviations in class A)^ 27(deviation^ in-class B)- 


X 


Y 


is accordingly, an alternative measure of discrepancy between theory 
and observation. We also calculate the following quantities 


N2 = Efia^ = //,“ n.,^ -I- . . . -f n,t^ 

Ns = Efta^ ---■ -f- -t- • . . -l-w*'* 

^ = i/X ilY + ilZ ^ .... 

y = (c - I ) (A^2 _ N2)!N{N - I) - (r i) (• - A^/A^) 

if A' is large 




= [(A 2 - Ns) (N. - A') (C - fi - i \ A-') 

-2 (AA3-A2^){c-2^-i -r (c I f“).V-‘}]''A’^(A 6 ) 

- (c-i)[(A“-A2)(A2-A)-! 2(A'2—.AA3) , A> 

if .\’ is large 


«= \/(y^ — ^) ^ ■ V(y • • i^^-l) 


If the continuity correction can be neglected 
find V' from the formula 


it is also possible to 


'E = ExaVX -f- EyaV y + EzaVZ -i- 


-N.JN . . (21.8) 
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We test the significance of ^ as follows: the significance of the sample 
is approximately equal to the probability that a standardized normal 
variable shall not lie between (— — V^)IC and 

in terms of the normal integral P{X) (Appendix Table 4) it is 

3 - P 

More exactly, if this expression is less than -047 we can safely conclude 
significance at the 05 level, and if it is less than 0074, then ^ is signifi¬ 
cant at the 01 level. In practice this usually amounts to saying that ^ 
has *05 significance if (V^— V^)IC > i'7 and -oi significance if 

V^)IC > 25. 

'I’hus for the taste-testing results given in Table 21.2 we have 
N = 1656, c — 2, 


A'2 = 172- -f- 629- + 251“ -r 604- = 853042 
A'3 172® -f 629^ -h 251^ -f 604^ = 490108752 

^ = (54“ + -f 82- + 222-)/5i9 

+ (ri82 -f- 468- -f- 169- 4- 382“)/! 137— 853042/1656 
= 6-696 
\ T = 2-578 

/S -- 1/519 -r 1/1137 = -002806 
y -6894 (-6889 by approximate formula) 

S = -1918 (-1916 by approximate formula) 
c = -v/(y2 — S) = -522 C = \'{y — e) = -409 

{\'^' — V^). C = 4'54* This considerably exceeds 2-5, and is therefore 
highly significant. 

I liis test is rather more laborious than y-. But it is probably much 
less liable to error when there are small numbers in some of the cells of 
the table; and it can also be more sensitive to the presence of hetero¬ 
geneity. It has not yet been carefully studied. 


:\()te .—When there are only two samples this test is essentially 
equivalent to y-; in such a case x~ ''’iH he both more accurate and 
easier to calculate. The same applies when there are 
Hr in all samples. 


equal numbers 


21.5 Differences between means 

Sometimes it is necessary to find whether two or more small samples 
can be reasonably considered as being derived from a single distribution. 
We usually test whether the means differ by no more than can reason¬ 
ably be accounted for by random fluctuations, since a difference between 
two distributions usually shows itself most clearly as a difference 
between their means. 

'I'he appropriate technique for this test is the simplest form of the 
“Analysis of Variance”, and proceeds as follows. 
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Suppose that there are n' observations in the first sample, say .v'j, 
a; 2» • • . . x'n>. Let the total of these obser\'ations be 7 ’' = at'i + x'2 -f 
... + a; and the mean x' = T'jn'. (This mean should be calculated 
to a fair number of decimal places, since it is usual for a number of 
places to be lost by subtraction in the course of the calculation.) Simi¬ 
larly let^the second sample contain n" obser\'ations, with total T" and 
mean x" = T"!n"\ we proceed in this way with each successive sample. 
Also let T — T' T" + T'" + ... be the “grand total” of all the 
observations, N ~ n' -f- n" + tt"' + . . . their total number, and x = 
T/N the “grand mean”. We shall suppose that there are k samples in all. 

We now calculate the following quantities: 

= AP'l" + X' 2 ^ + . . . + .V'',2 + 4- . . . + 

i.e. is the sum of the squares of all the observed values. 


/' / o 
o 


Z2 = x'T -H rT" + x"'T"' + . . . ; 


I. = xT. 


Finally we calculate 


the “within samples mean square” = (Tj — Tj) (.V — 

with 1/2 = (A^ — k) “degrees of freedom”; 

the “between samples mean square” msqn — (2*2 — ^s)!{k — i), 
with v^ = {k — i) “degrees of freedom”; 

the “variance ratio” F — msqulmsqw . . . (21.9) 

The phrase “degrees of freedom” here indicates a certain number 
associated with the mean square; we shall not discuss here whv this 
particular form of words is used. 'I'he test ol significance is then per¬ 
formed by looking up the appropriate significant \ alue of F in a special 
table (Appendix I'able 7), where it is given for the ap|>ropriate values of 
I'l and V2. If F is greater than this value we can conclude that the sam¬ 
ples are derived from distributions with differing means. 

We can perhaps see why this test should work l)y considering the 
special case in which there arc just two means being compared, x and 
x'\ so that k = 2. In that case we know that the deviance, or sum of 
squares of deviations from the mean, is equal to 







in the first sample, and to the corresponding expression in the 

second sample. It follows from the definition of 2’, and 2’o that 2*, - 2'o 

= S'gx + and therefore the mean square within samples is by 

definition 


= (Z-j + - k) ------ {S’,x 4 .V'V.)/(«' i - 2 ), 

with V2 — n' -f- fi" — 2 deg rees t)f freedom. Also we know that the 
grand total T'must be the sum of the individual totals for the two .sam¬ 
ples, i.e. T = T' -\- T" ~ n'x' 1 t^x", and the grand mean .v T .V — 
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{n'x‘ + ■tfx”)l{n' + ; therefore the mean square between is by 

definition 

= (^2 — 

= x'T' + x"T" _ xT 

= n'x'^ + - {n’x' + n"xyi(n' + n") 

_ {n' + n") {n’x'^ + n''x"^) — (n'x' + n^xy 
~~ n' 

and after a little manipulation this reduces to ii'n^ix’ — xyi{n' + ^0* 
Therefore 


F = msqBimsqw (by definition) 

= n'n\x' - xy («' +71" ~ 2 )l(n’ + n") ( 5 '„ + 5 ",*) 

Let us write y/F — t ; then 



This expression is known as “‘Student’s’ /, with v^—n' + — 2 

degrees of freedom’’. It is accordingly equal to the square root of the 
variance ratio F when only two means are being compared, i.e. when 

= 1. 

However we can interpret this expression as follows. Suppose that 
in each of the two samples the true standard deviation is a. Now we 
know that S'^rlin' — i) is the sample variance v' for the first sample, 
and will therefore approximate to the true variance, i.e. S'xx will 
approximate to («' — Similarly •S"** will be approximately 

{h" — I)CT^ and S'xx + ^ {n' -f n" — z)a^. This means that the 

/S' S'' \ 

expression —^2 )* occurs in the denominator of /, is 

in reality simply an estimate of the variance a® obtained by combining 
the two samples. But x has variance a-Jn', x" has variance a-fn", and 
therefore x' ~ x" has variance ts%ijn' + i/w"). and standard error 
+ ijn"). Thus we can write: 

^ _ difference (.v ' — x") between the two means 
estimate of the standard error of this difference. 


If both samples are large the estimated standard error, in the denomin¬ 
ator, will be practically equal to its true value, a\/(i/»' + i/”")* ^nd t 
will accordingly be an ordinary' standardized normal variable; it will be 
level). But if the samples are small this approximation breaks down, and 
judged significant if |/[ > 1-97 (at the -05 level) or if \t\ > 2-58 (at the -oi 
a table has, again, to be used to judge the significance of t (see Appendix 
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Table 7). Alternatively we use = F for = i and vg = n' + n" —2 
degrees of freedom. 

Strictly speaking this test is only valid if all the distributions are 
normal in form and have equal variances. But it is not seriously in¬ 
validated by any mild departure from these conditions. 


EXAMPLE 


(i) The following values were obtained for the threshold concen¬ 
tration of phenylthiocarbamide which could just be tasted by 114 males 
and 100 females (H. Harris and H. Kalmus, Ann. Eugen, Lond., 15 
(1949), 29). 


Table 21.4—P.T.C. thresholds for males and females 


Threshold 

concentration 

0 

1 

! I 

2 

1 

1 

3 

4 

5 , 

1 

1 

6 

7 

1 

1 

1 

1 

1 8 

9 

' 10 

I I 

1 

1 

1 1 

1 

1 12 : 

1 

»3 1 

1 

1 

; J 4 

1 Total 

No. of males 

3 

14 

6 

9 

4 

' 1 

3 

2 

1 

6 

14 

20 

21 

6 

3 

3 ! 

' 0 

114 

No. of females . . 

5 

4 

5 ' 

7 

7 

1 2 

1 

2 1 

I , 

■ 5 

1 21 

1 ' 

12 

4 

3 

I 

ICO 

Total 

8 

18 

I I 

16 1 

X 1 

5 

4 

7 

19 

141 ^ 

' 

> 1 

18 

1 

7 i 

1 

1 

6 ^ 

I 

214 


Is there a significant difference between males and lemales? (Note, the 
concentration here is measured on an arbitrary logarithmic scale, o 
corresponding to the strongest solution a>id “14 to the weakest. Tor 
details see the original paper.) 

Here there are eight persons with tlireshold o, eighteen uith 
threshold I. and so on. 'I’he sum of squares of the observed values is 

therefore 


= 8 X o2 + 18 X i2 4-.... d- I X 14= = 14127- 


Similarly the total of observed values for males is 

r' = 3 X o + 14 X I -i .... ox 14 = 781 

and the mean is -781/M4 = 6-85088^ For females 

T" = 764, x" = T'fn" = 7-64000. So -- T x j- T x — i r iSyS- 

Finally the grand total T= T + T’ - i 545 . '' = 

1545/214 = 7-21963, and ^3- Tx= 11138-9- 'Hiusthc mean square 

within samples” is msqw = 

= (14127-0 — 11 i87-5)/(2I4 “ 

= 13-9 with {N — ft) ►'2 -' 212 d.f. 

The ‘‘mean square between samples” is nisqn = (-^2 


= (11187-5 — 11138-9)/* , . 

= 48-6 with {k — 1) *'i - * 





632 

and the variance ratio is 


BIOMATHEMATICS 


F = msqsltnsqw = 3-50 

We have no entry in the F table for Vj = i and V2 — 212 exactly; the 
nearest entries are vi — i, vg — ^20, F = 3-92 at *05 level, and Vj = i, 
vg — 00, F — 3 84. The observed value is smaller than either of 
these, so it cannot be considered as significant. But it is not much 
smaller. This suggests that there probably is a sex difference, but fur¬ 
ther investigation would be needed to establish its existence. 

21.6 Maximum likelihood 

There is apparently a dominant gene which causes the blood of 
its possessors to agglutinate when tested with a certain special serum 
(S. T. Callender and R. R. Race, Ann. Eugen. Lond., 13 (1946), 102-107). 
Those persons who have this gene are said to be of “Lutheran” blood- 
group, or to be “Lutheran positive”. Thus if the corresponding reces¬ 
sive gene is Lu^y the Lutheran positives have the genetical constitution 
Lw® Zz/® or Lw® Lu^y and the Lutheran negatives are Lu^ Lu^. Callender 
and Race found 46 Lutheran positives out of a sample of 582 persons 
tested. 

Let us suppose that the proportion of Lu^ (recessive) genes in the 
general population is A: then the proportion of Lu° genes will be (i — A). 
If mating is at random the proportion of Lu^ Lu^ or Lutheran negative 
individuals will be A-; for the chance of obtaining an Lu*’ gene from 
either parent is A, and these probabilities are independent, and can 
accordingly be multiplied. We would like an estimate of A from our 
observed proportion of 582 — 46 = 536 Lutheran negatives out of 582 
persons tested. 

An obvious estimate / of A is obtained by putting the theoretical 
proportion A- of negatives, estimated by /-, equal to the actual propor¬ 
tion 536/582; this gives /- — 536/582 = •921, / = \/’92i — -960, i.e. 
96 genes out of ever>' too are of the recessive type. But it is not 
immediately obvious what is the standard error; nor is it absolutely 
clear that this is necessarilv the most accurate estimate possible. 

I'o overcome these difficulties R. A. Fisher elaborated a method of 
estimation, called the “method of maximum likelihood”, which has the 
advantages that it is of very general application, and that in large sam¬ 
ples it is always the most accurate method possible and has a known 
standard error. 

I'he first step in this method is to find the expression for the probabil¬ 
ity of obtaining the sample actually observed. We know that the chance of 
obtaining .v events of kind A and _v events of kind B out of 7i = -v y 
events altogether is = \n p^q'^l\x\^y where p is the probability of any 
given event being of type A, and q = i p that of being of tj-pe B. In 
our case we take the event “. 4 ” to be the selection of a Lutheran positive 
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individual for test, and event “ 5 ” that of a Lutheran negative. Then at, 
the number of positives, is 46, and /), the probability of obtaining a 
positive, is i — A^; similarly j = 536, q = A^, and n = 582. So 

Pxv^ 1582(1 — A^)*^(A^)S36/|46[536 . . . (21.10) 

Now the quantity A is in theory a perfectly definite number. It 
could be determined if we had sufficient sera to distinguish all three types 
Lw® Lm®, Lu^ Lu^ Lu^, and if we could test the whole British popu¬ 
lation. Unfortunately we have neither a serum which will distinguish 
between Lu^ Lu"^ and Lw® Lu^y nor have we the time and resources to 
make an extensive survey; so A is necessarily an unknown quantity. 
However various hypothetical values of A can be tried and the corres¬ 
ponding values of calculated from equation (21.10). Thus we obtain 
a graph of p^y plotted against A, which is shown in Fig. 21.2 (only the 



Fig. 21.Z—The probability pxy of obtaining x = Lutheran positives 
out of X y = 582 individuals for various hypothetical frequencies A 

of the Lu^ gene 


range of values from A = 0 9 to A = 10 is shown in the figure, for tl»e 
remaining values, from A = o to A = 0 9, p^y is very small.) We notice 
that this graph has a maximum point around A 0-96, which was our 
previous estimate of the value of A. In fact we can show that the maxi¬ 
mum occurs exactly at our previous estimate. In order to simplify the 
argument it is convenient to consider not p^y but its natural logarithm 

L = In p^y = In (1582 /[46153b) T 46 hi (i — A-) -f- 536 In (A-) (21.11) 

This does not affect the discussion, since the probability />,„ will neces¬ 
sarily be a maximum when its logarithm L is also a maximum, and vice 
versa. Now we know that when L is a maximum its derivative 

DaL == 46(-2A)/(i - A^) -f- 536(2A)/A'^ - U (say) 
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is zero. That is, if / is the value of A for which L is a maximum, 

46(-2/)/(i — + 536(2/)//=* == o. 

We can take out the common factor 2/, which cannot be zero (since / — o 
would be an absurd estimate of the value of A), so 

46/(1 -/2) - 536//*, 

46/2 = 536 (i — /2) = 536 — 536/2, 

/2 = 536/582 or / = V(536/582) = -960 

exactly as before. 

To find the error of the estimate / we proceed as follows. We find 
the quantity /= — D>?L — — D\U by a second differentiation; after 
some simplification this reduces to 

I = 92/(1 - A 2 ) + i84A2/(i - A 2)2 + 1072/A* 

and we substitute in this the estimate I — *960 of A, obtaining I = 
2'99 X 10*. The error variance of / is then approximately v — i//, and 
the standard error is s = - y/v = ij^^I — -0058. (For proof, see Section 
21.7, example 2.) Thus the true value of A can be expected to lie 
between / — 2s = 948 and I zs — -972. 

The quantity I has been named by Fisher the ‘‘quantity of infor¬ 
mation about A in the sample”. As far as it concerns us here it is 
nothing more nor less than the reciprocal of the error variance v of the 

estimate/. / itself is called the maximum likelihood estimate. It can be 

defined as that value of A for which the probability of obtaining the 
observed sample would be as great as possible. 

For the sake of clarity of thought, note that we do not say that / is the 
"most probable value” of the unknown parameter A. In fact A has only 
one value, which is its true value. A probability, in the technical sense 
in which we use the word, is a relative frequency or proportion. We 
cannot apply such an idea to A. If we repeat the sampling several times 
over, we shall not obtain a proportion of times in which A takes one 
value, and another proportion in which it takes another value; on the 
contrary we suppose that the true value of A is the same for all samples, 
although the estimate / may vary from one sample to the next. It is true 
that in the above argument we have imagined what would happen if we 
took various values of A, as indeed is inevitable in framing a definition 
of a "maximum likelihood estimate”. But this is a purely hypothetical 
variation; it exists in the imagination. It could conceivably happen in 
reality; if we took samples from various countries, such as France, 
Spain, Belgium, etc., we might well find that the frequency A of Lu^ 
genes varied from one country to another. But such a variation is not 
essential to the argument. Some wTiters do indeed talk of the "most 
probable value of A”; but this is either a careless use of language, or else 
they are using the word "probability” in an unusual way, and one which 
is not at present generally accepted. 
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Let us express this method in general terms. Suppose that in a sam¬ 
ple of n observations in all, we find x of kind A, y of kind B, z of kind C, 
and so on; so that -v + j -f- z Suppose further that we know 

that the true probabilities, p, 9, r. . . of observations falling into cate¬ 
gories A, B, C. . , respectively depend in a known way on a single 
parameter” 0 , but that the true value of 0 is unknown. That is to say, 
p is some known function P(^) of 0 , and the same applies to g =q (0), 
and so on. To avoid ambiguity we shall denote the true value of 0 
by the symbol 0*, and the corresponding values of p, q, r, . . i.e. 

etc., by the symbols />*, q*, r*, . . .respectively. 
Without the star affixed these symbols stand for hypothetical values of 
Pt . . . With such values the probability of obtaining the sample 
actually observed will be 

= \np^q''T^ -/| 5 l^i£- 

and the natural logarithm of this probability is 

L = K-\-x\Tip-\-y\Tiq+z\nr (21.12) 

where K ^ ln( [w/j.vp'|5r. . .) does not involve the unknown parameter 
0 . It follows that the derivative Lq = DqL — V say is 

^ = xpdp H-jW? + “''o/'' + • • - . (21.13) 

where pQ := Dap. The maximum likelihood estimate of 0 will be that 
hypothetical value of 0 for which L is a maximum; for this, 0 must be 
a solution of the equation U =■ Lq = o. This solution we will call 
It may happen that there is more than one root of the equation V — o\ 
but if so it will usually be found that all roots but one can be ignored, as 
providing quite absurd values of 0 . 

If we repeat the experiment by choosing another sample of n obser¬ 
vations we shall obtain another estimate t of 0 , and by repeating the 
experiment a sufficient number of times we shall have a distribution ot 
estimates. Fisher has shown that for large values of n the distribution 
approximates to a normal form with mean very nearly equal to the true 
value 0 * of <?, and variance approximately —i//, where /= —Vq - 
—Loo. From equation (21.13) differentiation 

JJq =-- .v(/)/oo/> — p^~ip'^) + yiqm'.q — + • • • 

But when n is large, .v will approximate to «/>, y to riq, and so on, and 
therefore 

Uo ^ Ti(p„o — po~!p) -f ri{qw — <7o':q) T ■ • • • 

— + ^^00 4' ■ ■ • pi)~lP 7o"/7 ■ * •) 

But we also know that p ~\- q r — i, independently of the 

value of 0 , and therefore on differentiating both sides of this equation 
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with respect to 6 we obtain + To + . . . = o, and by a second 

differentiation, />oe + ^00 + t'oo + . . . = o identically. This fact sim¬ 
plifies the formula for Uq\ and we have finally 

I = - n{pQ^jp + qQ^jq +...). . (21.14) 

Thus we have two alternative formulas for /, viz.: ~Uq and 
^(P^^IP “b +•••)• Either of them can be used to calculate the 
standard error, 1/V/, of t. For either formula is strictly speaking, only 
an approximation, true for large samples; and in large samples — Uq 
and fiipa^ip + q'^jq + . . .) are very nearly equal. However the form 
^{P^^tP + 90V9 + . . •) has the advantage that it does not involve the 
observed numbers .v, y, a*. . . separately, but only the sample number 11 
and the estimated parameter 6 . 

From (21.14) see that / = — Uq= —Lqo is necessarily positive 
in a large sample, i.e. is negative. If follows that the solution of the 
equation U = Lq = o corresponds to a maximum and not a minimum of 
L (see Section 12.5). This justifies the name “maximum likelihood’*. 


EXAMPLE 

(i) In a linkage experiment with maize, involving the self-fertiliz¬ 
ation of a plant of genetical type Ab'aB^ the following proportions were 
obtained: 32 (= .v) double recessives aabby 906 (= y) of type Abb (i.e. 
Aahb or AAbb, since A is dominant), 904 (= xr) of type aaB, and 1997 
(= w) of type AB, making a total of n = 3839. (Here A denotes the 
gene responsible for starchy endosperm, a = sugary, B = green base 
leaf, and b ~ white. I’he experiment is quoted by Fisher in his 
book, Statistical Methods for Research Workers. Now a recessive aabb 
can only arise bv each parent contributing both an a and a b gene. This 
can only happen if there is a recombination in each parent, and if more¬ 
over the chromosome handed down is that containing the a, b genes, 
and not the A, B genes. If c is the recombination fraction in the male, 
and c' that in the female, the chance of this occurring is p = . if' = 

,Jrr' \ 0 say, where B ^ cc' is the product of the two recombination 
fractions. 'I'liis is therefore the expected proportion of aabb progeny. 
Now the recessives hb are expected in a total proportion of by Men¬ 
del’s laws, and so if q denotes the expected proportion of Abb indivi¬ 
duals, we must have q --- i —p — 4 — “ d(i — ^)- Similarly r, 

the expected proportion of aaB, must also be d(i — ^)i ‘*f^d if s is that 
of AH, since 7 + r ^ i, we must have y = d(2 + 6 ). We 
now wish to estimate 0 from a comparison of these expected propor¬ 
tions with those actually observed. The maximum likelihood equation 
of estimation is (equation 21.13) 


I ’ .\poip 4- yqo'q + zro r + tvso’s = o 
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where/* q = r = |(i 6 ), s = ^(2 + 0), whence by differenti¬ 

ation Po = Sq = qQ = tq = ~l. By substituting the observed 
values ot x, y, z, w, this gives the estimate t as that value of 6 for which 

32/0 — (906 + 904)/(i — 0 ) 1997/(3: + 0) = o 

On simplifying this equation and solving it we find / = 0357- The 
value of I is obtained by differentiating U again with respect to 0 , ob- 

^ = 32/H- (906 -h 904)/(i — ^)- + 1997/(2 + 0 )~, 

and then substituting in the estimate t = 0357 for 0 . We find the 
standard error of t to be ij\/I = 00583. 

21.7 A general method of estimation 

Sometimes the Maximum Likelihood method is awkward to apply. 
It may then be better to use a simpler method, which will not always be 
cpiite so accurate, but which will lead to easier and quicker calculations. 
This IS often a question which can only be resolved by consideration 
of the particular experiment involved. Is it better to spend time on 
heavy calculation, with the prospect of some gain in accuracy in the 
final answer? Or is it better to spend the time in repeating the experi¬ 
ment, and thus increase the accuracy by having more observations? It 
must also be remembered that although the maximum likelihood 
method reduces random error to a minimum, that may not be very 
important if there is a systematic bias in the sample, as can sometimes 
be unavoidably though very regrettably the case. 

A very general method of estimation is the following one. We shall 
use the same symbols as in the preceding section, and also consider the 
same kind of situation. We shall also denote by P the observed propor¬ 
tion x/n falling in class A in the sample, and similarlv Q - 3’, w. R - 
etc. We take any reasonable function '/^(T, Q, R, . . . 0 ) of the 
observed proportions and the unknown parameter 0 , and set it equal to 
the corresponding function of the expected proportions />, <7, r, . . . and 
of 0 . That is, we take as our estimation equation 

Q, R, . . . 0 ) '/'(/>, q, r, . . . 0 ) . (21.15) 

Since p, q, r, . . . can all be expressed in terms of 0 , this can be con¬ 
sidered as an equation involving 0 ; its solution will be the estimate we 

require. 

As an example we might take the function 'J'(P, (), R, . . . 0 ) to he 
simply P itself, so that the equation becomes P />. In terms of the 
genetical example we have considered above, P is the observe<l fr.iction 
of double recessives aabb, i.e. P xjn 32/3839, while /> is the ex¬ 
pected fraction \ 0 . 'I'hus we have the estirnalion equation 32/3839 = 
the solution of this is the required estimate t' (say) - 4 X 32/3839 
jr '®333 ■ iu this case this is not appreciably dilferent from the maximum 
likelihood estimate *0357. 
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The maximum likelihood formula can be considered as a particular 
case of the general formula (21.15). For let us take our function W to be 
l/jn = Ppjp + QqQlq -f- . . . . Then 

¥^0, q^r,...d) = PPqIp + qqelq + . . . 

(by substituting p for P, q for Q, etc.) 

= pe + 96 + • • - 
= o, since p -|- j + . . . = i. 

So the equation ¥'(P, - 0 ) = ^{py 9, ... 9) reduces to Ujn = o, 

i.e. U = o, which is the maximum likelihood equation. 

Now when we have a large sample the obser\'ed proportions P, Q, 
R, etc., will not be greatly different from the true proportions/>*, q*^ r*, 
. . . and we can reasonably expect the estimate t' (say) of d derived from 
equation (21.15) nearly equal to the true value 6 *. (This can 

indeed be shown to be true under very general conditions, but the 
rigorous proof involves more advanced theory.) Let us therefore write 
t' — 6 * — S/', so that ht' is simply the small error in our estimate t'. 
Furthermore let />, q, r, . . . denote the proportions expected in the 
various classes corresponding to the hypothetical value V of B, and let 
us write/» — p* = Sp, etc. Then the equation of estimation for t\ i.e. 
¥'(P, Q, R, . . . 9 ) = H^{p, q, r, . . . 6 ) can be written approximately as 

5P + V'o* 5)3 4- . . . -f ^0* 5r 

= 'P* Pp* Bp H- Pq* Bq + Pq* &t' 

since BP, BO, Bp, Bq, etc. are all small (see Section 9.3). Here P* is the 
true value P {p*, q*, r*,. . . 9*) of P, and Pp* means the value of the 
partial derivative Dp P{P, Q, R, . . . 9 ), keeping Q, R, . . . 9 constant, 
when P — its true values p*, Q = q*, etc. The terms P* and 
cancel, and so 

V'p* 5P + V'o* S)3 + . . . = Pp* Bp + Pq* hq -P .. . 

= Wp*pQ* 8i' + Pq* 9o* sr + . . . 

where Po* denotes the value of the derivative DqP when 9 takes its true 
value 9 *. Here Bt' is a common factor of all the terms on the right- 
hand side, and can be taken out, giving 

Bt- = t- ~ = (Pp* BP-\-Po* 8 Q + . . .)l(^P*po*^^Q*qQ* + • - .) 

(21.16) 

'I'his formula gives us, to a first approximation, the error Bt' in our 
estimate t' of expressed in terms of the differences SP, . • • be¬ 
tween the observed proportions P, Q, . . . and the corresponding true 
proportions p*, q*, ... . Note that all the symbols, such as Pp* pQ*, 
occurring on the right-hand side are by definition constants, except for 
these differences SP, SO, etc. They are unfortunately unknown con¬ 
stants, so the formula cannot be used in practice as it stands. But we 
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nevertheless obtain some useful conclusions. In order to do so it is best 
to rearrange the formula algebraically in the following way. Let 
-j- Wq* q* = J (say): then 

Y'p* SP -f SJ 2 + . -. = + ^o(Q -?*)+... 

= ^P*p + + ... -y(P + 2+..) 

= {^P*-J)P + {^o*-J)Q+... 

= aP + bQ + . . . . . (21.17) 

where a = !Pp* —b = — J, ■ ■ • and so on; these numbers a, 

b, . . . are constants, though unknown. Furthermore, since p^* + q^* 
+ . . . = o we have 

^^0* + + . . . = i^p* -m* +{^Q* - 7 ) 90 * + . ■ . 

— 'Pp*Pq* + ^C>* 9 e* + • . . — JiPa* + 9 o* + • •) 

= ^p* pQ* + ^Q* q*f) + . . . 

So that equation (21.16) can be alternatively written as 

bt' = {aP + bQ + . , .)i{apti* + bq^t* + • - ■) - (21.18) 

Now consider the numerator w = aP H- hQ ^- . . . If we repeat the 
experiment or sampling process many times, we shall obtain a distri¬ 
bution of values of P, Oy R, . . ., the observed proportions, and there¬ 
fore a distribution of values of w, and of ht' = 'lvl{op^i* 4 bqQ* -f . . .). 
By (20.41) the mean value of w in this distribution will be 

= ap* + bq* 4- cr* 4 ■ • • 

= {^p*-y)p*+{^'Q* y)q*^\ + • -. 

= ^P*p* + Po*q* 4- 4- . . . -y{p* \ q* . .) 

= y~y==o. 

The variance of lo is therefore, also by formula (20.41), 

= {a-p* + b~q* 4 - c-r* 4 - - • ■)/»• 

It follows that ht' = zvf(ap(j* 4- bqo* 4- . . . ) has mean zero and 
variance 


u = (« 2 p* + b^q* -I- c^^r* 4 - . . .)/«(«/>o* + f^qo* 1 t • • •)" (21.* 9 ) 

In other words the estimate /' is on the average e<jiial to tlie true value 
0 *, and its variance is equal to the variance of St' -- /' — 0 *, since the 

addition or subtraction of the constant 0 * does not affect the variance. 
So ti is the error variance, and n = the standard error of the estimate 
t'. Unfortunately we cannot know the true value of (f* as a rule, and 
formula (21.19) have to be used in practice with the estimated value 
t' of 0 instead of the true value, i.e. the estimated error variance will be 

V = (a^p + b^q + c^r + . . .)/fi(apo 4 - bqo 4 - + • • O' • (21.20) 
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EXAMPLES 

(1) We obtained an estimate of 9 in our linkage problem by taking 
W(P, Q, R, S, 9) = P. This gave t' = *0333. What is its standard 
error? 

We have 'f'p = i, = <=>> whence = i, Wq* 

= Wp* = ^s* == o. So 5 = Pp* P* + q* + 

= k9*, anda= ^p* -J = I - I9*,b = c = d^ -^9*. Also 

pQ = 96 = ?'0 = —L ^0 = L so po* = h 96* = ^6* = —h Sq* = h 

Substitution in (21.19) gives 

[(I _ ie*y-p* + 

^ n[(i - ie*)\ - {\e^)^ - ( 19^)1 + ae*)\Y 

- + Mn^]MY 

= -^ 9 *u - ^*)/^«« ^ ^*(4 - 

Substituting in this the value -0333 of our estimate of 9* we find the 
error variance to be v = 0000344 and the standard error to be \/v 
= -0059. 

(2) Maximum Likelihood, Take P{p, g, . , . 9) to be Ujn = 

PpGp 4- !39 o ^9 Then ^p = p^jp, and therefore ^p* = po*!p*‘ 

So Pp* p* -I- Wq* + . . . = po* + 96 * + • ■ * = o> 

fore a = ^p* ~ J = />o*//’*. ^ = 9o*/9*> so on: a^p* + b-q* + • • • 

= (/>o*)' '/>* 4- (90*)-'9* + • . . = i*iny bv (21.14), and opQ* + ^>9o* 

4- • ■ — ipi^y^iP* + (9o*)V9* + • • • = i*h\- So hy (21.19), = 

(/*/«), ;;(/*/;/)- - - i /*. With the proviso that in practice we have to use 
the estimated value i 7 instead of the true value i//*, this justifies the 
assertion we made above that the error variance of the maximum likeli¬ 
hood estimate is i,I. 


21.8 Efficient estimation 

It was asserted above that maximum likelihood gave the best possible 
estimate of the unknown parameter 9 . We are now in a position to prove 
that statement, at least as regards the method of estimation discussed in 
the preceding section. In order to do this we take any quantity ^ what¬ 
ever. and form the sum of squares 



- {(i\ p* — ip(i*, \ p*)- 4 - (A\ '9*—^ 9 o*/\ 49 *)“ 4 - - ■ • 

[a-p*~za^p^*~re{p^*Y:p*] L 4-^W)V9*] T ■ • • 

^ {(i-p* vb'q* + . . .) —z^iapQ* +^ 90 * • ■ •) 4 - 


Since this is a sum of squares it must be positive (or zero) whatever value 
we take for f. Furthermore I* n — {pQ*YiP* "h ( 9 o*)" 9 * 4 - - • - will 
alwavs be positive, except in the special case in which />©* = 9 e* = . . . 
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= o, which we can safely ignore, as it never occurs in practice. Let us 
put ^ = (a/>o* + bq^* + . • this shows that the quantity 

S = + 6=9* +...)- 2 (ap^* + bqo* + ■ . •)=«//* 

+ (.apo* + bq„* + . . .Yn^I*ln(I*)- 
= (a^p* + *^ 9 * +-) - (ape* + *9o* + • • .)=n/'f* 

is necessarily positive or zero. Divide through by the positive quantity 
?i(apQ* -|- bqa* + . . .)2 (again assuming that this does not happen to be 
zero); we then see that 


{a^p* + b'^q* + . . . 






IS necessarily positive or zero. But v = {a-p* 4- b-q* + . . .) - 4 - 
tt{apQ* + bqa* + . . .)- is the error variance of the general estimate 
whereas ijl* is the error variance of the maximum likelihood estimate t. 
So i» — ijl* > o, u I//*, and the general estimate has an error at 
least as great as that of maximum likelihood. This proof does not ex¬ 
clude the possibility of obtaining other estimates which are just as good 
as maximum likelihood in large samples. Indeed such estimates can be 
obtained, and are frequently more convenient. By inspection of the for¬ 
mula (21.19) we see that the error variance of our estimate depends only 
on the values of a, b, c . . ., since the quantities />*. q*, po*, qa*, etc. 
are determined by the nature of the problem, and not by the method of 
estimation. Now a, b, c, . . . are in turn determined only bv the values 
of V'p*, ... If = poVp*, ^ * 70 V/*. • ■ • "'e have the 

maximum likelihood case. It follows that if we can choose the function 
in any way subject to the conditions •/'/>* — po*/p*, V'q* = • • • 

we shall obtain an estimate which is just as good as the maximum likeli¬ 
hood estimate, and which accordingly will be best possible. Such an 
estimate is called “fully efficient”. M. C. K. 'Pweedic has pointed out 
(Ann. Math. Stat. 24(1953), 498) a very general method of choosing such 
a function. \Vc take any convenient set of functions fxif*). /af'O* 

. . . which have the properties, /i(i) = /^(O =/.i(0 when u = i 
D„/,(w) = D„/2(/0 = ^-^u/a(«) I. We then use as the estimation 

equation 

np, e.. 0 ) = /-o/, Q + (~) ('•0/3 (f) 


+ 


= o 


This is of the general form (21.15), since 

'^{p> g, r, . . . 0 ) = po/iii) I- 70/2(0 f“ ''oA (0 + • 


(21.21) 


= /i(0f/« 7o 'i ''0 


- 4 - 


.] = O. 


P 

P 


when P 


Also, by direct differentiation, where u 

P 

andp both take the true value p* this becomes '/'/-* = />«*//>*, which shows 
that the method is fully cflicietU. It is also very general, since we can 
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choose the functions/,,/g./s ... in a great variety of ways. If we take 
/i(^) = = . . . = w it becomes the method of maximum 

likelihood; if we tie /,(«) — /2(w) =f^(ii) = —it becomes the 
“minimum method of H. Jeffreys and J. Neyman. 

EXAMPLE 

(i) Returning to the linkage problem of Section 21.6, in which the 
observed numbers in the four classes were x = 37, y = 906, z ~ 904 
and w = 1997 with corresponding theoretical proportions p = 
q ~ r = J(i — 0 ), s = J(2 + we use Tweedie’s method of esti¬ 
mation, taking the four functions /i(h)»/2(«)»/3(«)>/i(“) to be all equal 
to — u~^y i.e. we take as our estimation equation 

—poPiP — qoqlQ — Tqt/R — sqs/S = o. 

Since = J, ^5 = = — J, i this becomes 

— i[l 0 n/x ~ l(i — 9 )n/y — i(i — 0 )nlz + ^(2 + d)nlw] = o 
or on taking out the common factor w/i6, 

+ y"* + z~'^ + — (y~^ + z~^ — 2fiJ~*) = o 

i.e. $ = (y-i + — 2W~^)/(x~^ -r 

= (906-1+904-*—2X i997'^);(32-^+906-1+904-1+ 1997“^) 

“ *0356 (compare the maximum likelihood estimate •0357). 

I'he standard error is given by 1/ v//, where / is determined by formula 
(21.14); it is -00583, exactly as for the maximum likelihood estimate. 

A'ote .—The theory explained above applies strictly speaking only to 
large samples. But it is found in practice that maximum likelihood gives 
reliable results, even in small samples. 

PROBLEMS 

(1) In the example considered above, estimate 6 by taking the 
functions/,(w),/«(?/),/a(w),/,(«) to be all equal to In «. 

(2) Race, Sanger, Lawler, and Keetch (Anfi. Eugen. Lond.y 14 
(1948), 134-138) tested the blood-groups of 172 Latvians, and found 78 
to be MMy 75 AIN, and 19 AiV. 'I'he theoretical proportions are 6 ~ : 
■zd{i — 0 ) : (i — 6 )-, where 6 is the frequency of the gene Min Latvia. 
Estimate 6 by maximum likelihood, and find its standard error. 

21.9 Estimation of several parameters 

T he process described in the last section can readily be extended to 
tlie case in which the theoretical proportions p, q, r . . . depend on two 
or more unknown quantities or parameters. We shall consider the case 
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• 

in which there are two such unknowns, say 6 and -q ; so that p is then a 
known function of d and rj, and similarly for q, r, etc. Let P, Q, R . . . 
be the actual observed proportions, and n the total number in the sam¬ 
ple. We now choose any set of functions we like, say /i(«), /2(u), /sC^), 
• • •> SiW)> gz{^)y • • •, etc., subject to the conditions that/i(i) = 

f^) — - - • and^i(i) = ^2(1) = ^3(1) = ■ • and that ail these 

functions have derivative equal to i when u = i. For example we can 
take all these functions to be equal to u. We then solve the two simul¬ 
taneous equations 



+ 9, /. (^) 




(21.22) 


The values of 6 and -q thus obtained can be shown to be the most accurate 
estimates possible of the true values $* and »;*, provided that the sample 
is sufficiently large. To find their standard errors, proceed as follows. 
Calculate the values of the quantities 


h\ = + • • •) 

/12 = /g, = n{pQprjlp 4 - qQ q^lq 4 - Tq rr,lr + . . .) 
/ga — n{pr,^lp + qrj~lq + r^^/r + • • •) 

using the estimated values of 0 and q. 'I'he matrix / — 




called the “information matrix”. 


Invert this matrix by the method of 


Section 17.5 (see Section 18.11) to find the matrix /"* = r ^ ^ 

Then the standard error of the estimate of 0 is v that of the 

estimate of q is ^^22* 

If we have three parameters we shall have in a similar way three 
estimation equations, a 3 X 3 information matrix, and a 3 X 3 error 
variance matrix obtained by inverting the information matrix. Tlte 
standard errors will be the square roots of the elements on the diagonal 
of this error variance matrix. 


EX A M IM- E 

(i) There arc certain blood-groups, known as the Lewis groups, 
which have the following properties. A certain serum, “anti-Lewis a", 
will agglutinate the blood of some individuals, while another scrum 
“anti-Lewis b” will agglutinate that of other individuals. 'I’here arc a 
few persons whose blood is agglutinated by Jieither serum, but no 
adults have been found who react to both. 'I hese facts, together with 
certain others which we need not consider here, suggest the following 
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tentative explanation. There are three allelomorphic genes 
with the property that any person having at least one Le^ gene reacts 
with anti-Le*’ serum, while a person with two Le® genes reacts with Le^ 
serum. One of the steps in testing this hypothesis (which is still not 
entirely certain) is to find the frequencies of these genes in the general 
population. Suppose that the Le^ gene has frequency a, and the Le^ 
gene frequency ^; then the frequency of the Le" gene is y = i — a — 
We can therefore construct a table as follows: 


Table 21.5—Frequencies of Lewis blood-groups 


Blood 

reactions 

1 

Possible genetical 
types 

f I 

Theoretical 

frequency 

1 Observed 
number 

Le® positive 
Le^ negative 

Le® Le® 

1 

\ 

/> = a® 

\ 

11 

o^ 

Le® negative 
Le^ positive 

Le® Le**, he^ Le^, 

\ Le^ Le^ 

1 

q — 2 a^H-^=+ 2 y^ 

: 

y = jys 

1 

Le® negative 
Le^ negative 

' Le® Le^, Le‘ Le^ 

I r=2ay+y- 
j =i-a2-2j8+j82 

1 

z = 14 


Total number n = 238. Observed frequencies P = .v/«, Q = yin, R = zin 


Also (from the expressions of p, q, r in terms of a and ^ only) 

pa = 2a,/>3 =^0\ qa = o; = ^(l — ^)\ = —2a.r[i = —2(l — ^). 

Let us take as estimation equation the maximum likelihood equations, 
i.e. equations (20.62) with /i(h) =/2(“) == .^3(*0 = £'i(“) = “ 

g^{u) = u. These equations become 

px^lp "t” q^iQ-q + — o 

P + %QiH + — o 

i.e. 

2[a.v/a" — a«/(l — a" — 2;S + /3")]«“‘ = O 

2 [(l - ^)y (2/3 - ^2) - (l - i3)c/(l - - 28 - ^-)]«-l = O 

Multiply the first equation through by «a-(i — — 2^ + ^-), and 

divide bv 2a; multiply the second equation through by n {2^ — X 
(i — a- — 2^3 -f ^ 3 -) and divide by 2(1 — ^). We get 

a[i — a- — (2jS — /3-)] — a~Z = O 
y[l - a— { 2 ^ - n] - (2j8 - n- = O- 
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These are two linear simultaneous equations for the quantities and 
(2^ — ) 9 “); they can be written 


a%x + 5r) + ( 2 ^ - = .V 

a^y (2^ — (y z) = y 

and on solving them in the usual way wc find 

a- = .v/(.v y + ir) = xjn — P, whence a = \ P = -440 
2.^ — p = yj{x -h y + c) = yjn = Q, 

whence i — 2/3 + ^- = i — O, 

I — ^ = V(i — O) or ^ = 1 — ^'{1 - Q) -498. 

This gives us estimates of the gene frequencies. To find their standard 
errors, calculate 

/i j = n[p„^jp + qj^lq + r^^/r] 

- «[4aVa2 H- O + 4«“'(* - a" - 2 ^ + 

“ 3130. substituting ;/ = 238, a — 440, /3 — -498. 

7,2 = /.^, = q- q^ qrjq r, rp/r] 

= ttfo + o + 4a(i — ^)/(i - a- — 2^ + ^-)] - 3577. 

^22 = 4 - 

=/;[o+ 4 (i-W( 2^-^^') < -- 4400 


Invert the matrix / = 


[ •00450 — •oo366"| 

— 00366 •00320J 

and that of ^ is = 


[3^30 

1 to obtain r 

r«ii 

L3577 

4400J 



'I'hc standard error of a is \ 067, 

■057- 


21.xo Regression 

Fig. 3.1 (p. 33) is a graph obtained in ihc following way. W'e take all 
the fathers in a large sample whose heiglu is .v inches (measured to the 
nearest inch), find the mean height v of their sons, atui plot y against .v. 
The distribution of points so obtained is a little irregular, especially at 
the ends, because of ratulom fluctuations in sampling, but it ap[')rt)xi- 
mates very closely to a straight line. I'urthermore we see from this 
graph that if the father’s height .v is given, tlie average son’s height _v is 
nearer to the general mean height of the population (about 68 inches) 
than the father’s height is; in fact the average son’s heiglit “regresses” 
about half-way towards the mearj. 'Flius fathers of height 72 inches 
produce on the average sons of height 70 inches. For this reason I'rancis 
Gallon called this line a “regressitjn line”. 

In Fig. 3.5 (p. 41) the mean weight y of sclnK)lgirls is plottetl against 
their height X] this time the graph is curved. 1 liis also is known as a 
“regression curve”, although the original sense of the word “regression” 
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is hardly appropriate here. In general, if we have any two correlated 
variables x and y, then the graph of mean values of y for given values 
of X is called the “regression curve for_y on We can also plot the 
mean value of a: for any given value of jy; this will give a different curve, 
known as the “regression of .v on y*\ Thus if we take the distribution of 
degrees of overcrowding a: and infantile mortality y in Table 20.4, we 
can find values of mean infant mortality of each degree of overcrowding, 
and mean degree of overcrowding for each range of values of infant 
mortality. The points thus derived are plotted in Fig. 21.3; they lie on 



21.3—The regression lines of the distribution of Fig. 20.5 

Dhick circles = observed means of y for given x 
white ,, = „ ,, X ,, ,, y 

double lines = theoretically calculated regressions 

regression curves which approximate to straight lines, but which 
are very irregular owing to the smallness of the sample. 

Every distribution of two variables will have regression curves 
dehned in this way. 'I'he question arises as to whether this amounts to a 
merely formal definition, or whether it can be given a fairly concrete 
interpretation: the answer will depend on the particular problem in¬ 
volved. Hut suppose for instance that x is a variable which can be 
measured accurately, such as the time, whereas is subject to appreci¬ 
able errors of measurement or other random fluctuations. It is reason¬ 
able to suppose in most cases that if we repeat the experiment many 
times, the mean of the observed values of 3.' will approximate to the true 
value, as the errors of measurement will tend to cancel out. Thus the 
true relation between .v and y will be obtained by plotting the mean of 
r against .v, i.e. by taking the regression of y on x. This will not be true 
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if X is also subject to error, but the problem then involves severe mathe¬ 
matical difficulties which have only partially been overcome. 

Let us suppose then that we can write for any particular value of y 

y = a + bx € . . . (21.23) 

where a bx is the value of y for a point lying exactly on the regression 
line, and e denotes an extra error of measurement or random fluctuation 
which is added toj>’, and which on the average is zero. If a relation such 
as (21.23) holds, the regression curv^e is accordingly straight, having the 
equation y = a bx. Now we might find the line by plotting the 
observed means, as we have done above, and fitting a straight line by 
eye. But this would not be very satisfactory, and the following method 
is preferable. 

First let us sum all the equations (21.23) observed values 

X and y, i.e. 

iTy = an + b2^x -f- 27 c 

where n is the number of pairs of values ( v, ;\') in the sample. Dividing 
through by n this gives 

y = a bx -\- "e 

But by hypothesis the error e is zero on the average. We shall therefore 
not be far wrong in supposing that the sample average € is near enough 
to zero to be neglected, i.e. that 

y — a bx . . . (21.24) 

This means that the regression line must go through the mean point 
{x, y). Furthermore let us multiply both sides of (21.23) again 

sum over all observed pairs of values of .v and y. We obtain 

2 ^xy = aZx + AT.v- + Tear . . (21.25) 

Again wc shall not be far wrong in taking Te-v to be negligible. Multiply 
(21.24) subtract from (21.25); have 

£xy — _yi 7 .v = b(I^x~ — ^i 7 a:) 

But 2^xy — y£x is by definition the codeviance or sum of products of 
deviations of x and y, and was denoted in Section 20.12 by the symbol 
S^y. Similarly — x£x is the deviance of ar. Thus = bS^^ or 

b = S^JS^^ . . . (21.26) 

Since S^y = (« — = {n — i)?Vx, this can also be written 

b = ^'rvl^xK • • ‘ (^t.2/) 

b is known as the “regression coefficient of y on .v“, and is accordingly 
the quotient of the covariance divided by the variance of x. From 
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(21.24) a = y — bx and therefore the equation of the regression line can 
be written 

y — a bx 

— (y — bx) + bx 

or y—y — b (x — 5 ) . . . (21.28) 


For the distribution of Table 20.4 vve have already calculated in Sec¬ 
tion 20.14 the values x = 17 80, y = 125*19, = 89 0, = 130, 

Vyy = 427, whence b = Vxvivxx = 1*46. The equation of the calculated 
regression line of y on x is therefore (y — 125*2) = i*46(a; — 17*80). 
This is easy to draw; it is enough to draw the line of slope b = 1*46 
through the mean point {x^ ^); or alternatively we can find one other 
point (.v, y) satisfying the regression line equation, and join it to (x, y) 
by a ruler. The regression line of x on y will similarly be (.v — x) ~ 
b'(y — y), where b' = Vxvl'^'vv = ' 304 * These two regression lines are 
shown in Fig. 21.3 for comparison with the crude estimates obtained by 
the simple plotting of means. 

The regression coefficient b can be interpreted as the average increase 
in y caused by unit increase in .v; thus the assertion that the regression 
coefficient of son’s height on father’s height is means that an increase 
of I inch in the father’s height increases that of the son by ^ inch, on the 
average. Of course the value b ~ is strictly only an estimate of 

the true value fi = v^y:vj.x of the coefficient. The standard error of b is 


approximately 

“ — 2) 


More precisely, if is the signifi¬ 


cant point for “Student’s” i at (say) the -05 level of significance for = 
{n — 2) degrees of freedom (Appendix Table 7) then the sample co¬ 
efficient b can be expected to differ from the true regression by not 
more than ^o^b- 

More complicated regressions, such as y = a bx + can be 
fitted in a similar way. Such an equation implies that any particular 
value of f is of the lorm y - a 4 - ^.v r.v* + e, where e is a random 
error, independent of .v, and ot zero mean. By summing over all pairs 
of observed values of .v and v we shall therefore find 


= nil + b~x + c^x- -f Ze 
= flA-.v -I 6r.v-’ + + re.v 

Sx-y = aSx- -f bSx^ + rT.v* -f- rc.v-. 


'I'he sums T.y. X.y-, T.y=*, Sx\ A*v, -.yv. 2’.v-y are all calculable from the 
data, though with a considerable amount of labour. The errors \\dll 
tend to cancel; so if we assume that the sums 2'’€, Hex and 2 ^ex- are 
negligible we shall have three equations to determine the three unknown 
coefficients a, b, c. Furthermore this method can in certain cases be 
related to maximum likelihood, and thereby shown to be the most 
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accurate method of estimating h, and r. However in general it is very 
laborious. When the observed values of x are equally spaced, there are 
tables in Fisher and Yates’s Tables for Biological, Medical and Agri¬ 
cultural Research which greatly reduce the labour. (See also P. G. 
Guest, The fitting of polynomials by the method of weighted grouping, 
Ann. Math. Statist., 22 (1951), 537-548.) But in general it is best to 
reduce the regression to a straight line by using a transformation in the 
manner suggested in Chapter 7. 

21.II Discriminant functions 

It is usual to distinguish between two species or two groups of indi- 
yiduals by using a whole set of characters, rather than a single one on its 
own. The reason is that if we consider only a single character we may 
find that there is a certain amount of overlapping between the species, 
so that in intermediate cases it may be uncertain to which species any 
given individual belongs; but if a number of different characters are 
taken into consideration this uiiccrtainty may he considerably reduced. 

There may be some difficulty, however, in deciding on the relative 
importance of the various characters. If an individual agrees in character 
X with species A, and in character y with species B, which one is it 
better to assign the individual to? Such difficulties can he overcome in 
the following way: we find a suitable function L{x, y, xr) of all the char¬ 
acters .V, y, z\ and we base our classification on the single function 
L instead of on the values of the character considered separately. 
Such a function B is known as a “discriminant” or “tliscriminator”. 
By choosing it correctly we can not only considerably simplify the prob¬ 
lem of classification, but also reduce the number of errors of classifi¬ 
cation to a minimum. 

i.et us suppose that in the population we are studying there is a 
proportion a of individuals of class A, and a proportion b = \ --a of 
class B. 'rhese classes may be species, se.xes, or any other groups into 
which it is convenient to divide the population. Suppose further that 
we make certain measurements x, y, z, . . . on each individual. (Such 
measurements will generally be numerical; but the theory w ill be eijually 
appropriate if they denote merely, say, the presence or absence of a cer¬ 
tain feature.) lyCt the j^robability tlnui an individual of class A has 
measurements x, y, z be p'^,uty i**'d the probability for class B be/>^'j.i,. 
Then by the product rule for probabilities there will be a proportion 
op'*TV, of individuals of class A with measurements v, y, or in the general 
population, and a similar proportitjii bp^'rui of class B with these meas¬ 
urements; the total proportion will be ap^^gy. ■\ 


9 

there will be a probability , 

he is of class A, and probability xv, ~ hp‘\ 
he is of class B (sec Section 19.8). Clearly ^ 

'lo illustrate: in 1947 II. Kalmus made a study of 521 mothers who 


l>P*'^vz- 

If then w 

e pick 

incuts to 

be .V, V, 

and O', 



.) that 

■ tH.oP''^ X V t 

\ bp‘^^y 
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uz I - 

■ 
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had been suffering from haemorrhage before childbirth. Of these it 
was found that 32 had the central type of placenta praevia, in which the 
delivery of the child may be hindered by the placenta (class A), while 
the remaining 489 had other causes of antepartum haemorrhage (class 
B) (Ann. Etigen. Lond.y 13, 283-290). Furthermore he found that out 
of the 31 mothers aged between 30 and 34 with no previous children, 
4 suffered from central placenta praevia, and 27 from other causes. 
Ignoring for the present any errors due to the smallness of the numbers 
involved, we deduce that of mothers aged 30 to 34 with no previous 
children, and suffering from antepartum haemorrhage, a proportion 
4/(4 + 27) = 4/31 = *12 will have central placenta praevia. But we 
can also obtain this proportion as follows: a, the proportion of central 
placentas (class A) in all mothers in the sample is 32/521, while 
the proportion of mothers of given age-group x (i.e. 30 to 34) and num¬ 
ber j (= o) of previous children, as a fraction of all those suffering from 
central placenta praevia, is 4/32. Thus = jVi X 

Similarly b — p^xv — and bp^^v — stt X So 

TT^arv = ^^ P '^ icvK ^ P^xv + ^p^xv) = 5 It/( 54 t -hT%T) = 4/(4 + 27) = 4/31, 

as before. 

Kalmus proceeded in this way to calculate the following table. (But 
the distributions were first smoothed out to counteract the sampling 
fluctuations: for details see the reference quoted.) 


Table ai.6—Probability that a case of antepartum haemorrhage is 

one of central placenta praevia 


No. of previous 


Age of mother 


children 

m 

20-24 

25-29 

30-34 

35-39 

40-44 

45-49 

0 

•02 

•03 

•05 

•09 

■16 

•27 

•49 

I 

•03 

•04 

•06 

•09 

•15 

•22 

•35 

1 

2 ' 

1 

•04 

•06 

•08 

•15 

•16 

•21 

3 


•05 ! 

-05 

•07 

•08 

■II , 

•^5 

4 

1 

1 

•05 

•06 

*06 

•07 

•09 

5 

j 


•OS 

■05 

■05 


•06 

6 


■ 

*05 

•04 

•04 

•04 

■04 

7 

8 

■ 


— 

•05 

•04 

■04 

•03 




•04 

•03 

•01 

•00 


Now when this probability high the surgeon may be inclined 

to treat the case as one of central placenta, whereas when 7r^a.„ is low he 
will dismiss the risk. Exactly where he will draw the dividing line may 
depend on his judgment of the relative risks of making an error either 
way: for a wrong judgment will certainly involve some risk, and he must 
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try to take the best possible course of action in the interests of the patient. 
But in making such a judgment he will be using this probability 77-^*^ as a 
discriminant function. If it is near i he will behave as if the individual 
belonged to class A, and if small, to class B. 

On dividing numerator and denominator by/>'^*v we obtain 

-^xv = H- 

= al[a bp^xvlP'^xv] . • . . (21.29) 


We can see from this formula that if the ratio p^^vlp"^xy is large, the 
denominator on the right-hand side of the equation will be large, and 
’fr^xy will be nearly zero, i.e. the indications are in favour of class B. On 
the other hand, if the ratio is small, then xy is nearly i, and the selected 
individual is probably of class A. This is important, because quite often 
we shall not know the values of a and the respective proportions of 
groups A and B in the whole population: and so we cannot calculate 
the accurate value of But we may nevertheless know the values of 

p^XV and p'^xvi and can calculate the ratio p^xy p'^xy This ratio can still 
be used as a discriminant function, since large values will tend to indi¬ 
cate class B, and small values class A. 'Fhis indeed is no more than 
common sense. If p^xy much larger than P'^xyy it follows that indi¬ 
viduals with the particular measured characters .v and y occur much 
more often in group B than in group A: and therefore if we come 
across such an individual at random we shall be much more inclined to 
think that he belongs to class B than to class A—altliougli if we do not 
know the values of a and b we shall not be able to express this preference 
as an exact probability. However it is customary and convenient to use 
instead of the ratio p^xylp'^xy its natural logarithm A = In {p^xv'P'^xy) 
= \np^xv — In p'^xv 'I’his simplifies certain calculations, and also has 
a certain symmetry as between classes A and B which is lacking in the 
ratio p^xvlp^XV There is no change in principle involved, since if we 
know the value of A we can deduce that of p^ xylp'^ xv~ e-'hand conversely 
if we know the ratio we can deduce the value of A. 'I’lie use of the one 
as a discriminant is completely equivalent to the use of the other. We 
have of course to remember that since large values of the ratio p'^xy'p'^xu 
indicate an individual of class B, and small values one of class A, in 
terms of the logarithm A this means that large positive values of A 
indicate B, and large negative values class A. \Ve might call A the 
“ideal discriminant function”. In terms of it equation (21.29) becomes 


■rr^xv = «/[" h 


(21.30) 


To illustrate: suppose we wish to decide whether an observed segre¬ 
gation of X individuals of one ty|>e toy of another is really a i : i ratio 
(class A) or a 3 : I ratio (class B). If it is a i : i ratio the probability 
of obtaining the observed sample is [.v T >'(I)'^( v'ly: if it is a 
3 ‘ I ratio the probability is p^^xy = 1’^ T y (4)^( I herefore 
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A = In — In 

^ In (| .V+>> /[y[3>) -\-x In ^-\-y\n |—In ( | .y -\-yl |.v \y) —x In j In | 

= A;(In f—In +jv(ln J— In 
= -405^: — -693^ 

Thus if we found an actual segregation x = 20,^ = 10, we should find 
A = S-10 — 6*93 = I'17, which is positive and accordingly suggests a 
3 : I rather than a i : i ratio. If we had x = 25,3; = 10 we would find 
A — 3-20, and so high a value of A would be fairly strong evidence for 
class B, i.e. a 3 : i ratio. 

Again we may often wish to distinguish between two groups A and 
B on the basis of two (or more) continuously variable characters x and y 
(and jy. . , .). Let us consider first the case of two characters, and let us 
suppose they are normally distributed in each class. We cannot then 
strictly speak of the ratio of probabilities of characters 

taking the values .v, r, exactly, since these probabilities are zero. But 
we can consider the probability that .v will lie in a certain small range, 
between Xi and .x'l -h S.Vj, and thaty at the same time will lie between 
y^ andy, + Sy; this probability is very nearly equal to <^^(A:i,yi) Bx Sy, 
where ^^^(.ri, yi) stands for the probability density function for group A. 
The same probability for group B will be <^^(.v„y,) 5 a: 8 y; and if we 
define A as the difference in logarithms of these probabilities, we shall 
have 

yl = In [<^V‘i,y,) Sa: Sy] — In [<^'^(.Vi, y^) Sa: Sy] 

= In <^%Vi,yi) — In «^'^(.Vi,yi) . . . (21.31) 


since the logarithms of Sa: Sy will cancel out. This is a perfectly general 
formula. However if the distributions are normal the formula for 
^'^(.v, v) is (Section 20.15) 

<f>‘^(x,y) = (ztt)-* (aj^)-i where 


^ - V V yy 


- 






zx 




A _ 


zy 


= —V^^yjoj^, 


t tv 


and/t^y, as usual the means, variances,and co- 

variance of .V and y in class A. There will be a similar formula for 
_v), and from them we can calculate A by equation (21.31): this 

gives 

A = Q'^ — i In {co^jco^) . . (21.32) 


Now in practice we shall not know the exact values of the means and 
variances of .v and v, but only estimates derived from the sample. Then 
we shall use these estimates to derive an estimate of A, which we can call 
L, and from that an estimate of the probability by using equation 
(21.30) with L instead of A. Thus if fn^, ffiy are the observed means of 
a: and 3’ respectively in a sample of class A individuals, and if ^'vy> 
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Vg.y are the corresponding variances and covariance, then we shall 
calculate 


^ ^ XX ^ V V ^ xy 


^XX = VyJW, tyv = V:,JtV, 




Instead of which is the true but unknown value, we shall use v.\ and 
instead of the expression 

i[^'xx(x ~ + 2ixv{x — — "'v) + ^yy{y ~ 

Similarly if M^, My, V^xi ^vv> ^xy ^re the means, variances and 
covariance for the sample of class B, we shall calculate the correspond¬ 
ing values W = = VyyjlV, lyy = V^^ilV, I^y = 

— V^yfiV, and make the appropriate substitutions for O® and oj^. 
Proceeding in this way we finally obtain the discriminant 


+ 2a,y.YV + a 

where — / 


yy^ 




2a^V -f 2a„_V + «] 


(21.33) 


a^.. = — / 


X V 


^ V V - ^ u V 


XX 
X y 




1 y yf 

= / 


xy y ^xx X ^xy y 


= Ixy A/x ^ / 


y y 


My - try f^^x — ' 


a — t 


XX 


4 - 27 


xy tftx fHy 


4 - / 


y y 


mY — /xx -lfx" — 2 /^^ 4/^4/ 

lyy My- lU {tV W) 


’^rhese equations may perhaps he best illustrated by an example. 
M. N. Karn and L. S. Penros' e tnadc an iuNcsti^ation ol the number of 
babies surviving one inonlli after birth in relation to their gestation 
time (.V days) and weight at birth (y lb). Let us c.ill the sur\lvors 
“class A” and the non-survivors “class H ". 'Then from a sample of 
6693 female babies they found the following v.ihies (./////. L'ui'c/i. Loud., 

(* 950 . i 47 -*f> 4 )— 


Table 21.7—Constants for distributions of Kestation time and 

birth weight in fem.ile babies 


Distribution 

constant 

1 (<•!) Survivors 

Hi) 

Non-survivors 

Proportions 

a 

•959 > 

h 

■ 0409 

Mean of a* 

ttlr - 

- 281-5 

Mx 

2594 

Mean of y 

Uly 

7 '32 

My 

5-284 

Variance of a 


151 y 

I'xx 

1017 

Covariance 

^’xy 

5-1 16 

, l\.y 

5246 

Variance of y 

7* 

*•' 1 / li 

I- 2 IO ' 

1 l\u 

4-830 

w w 

w 

^ 157^3 

IV 

- 2160 • I 


i XX 

•007676 

i XX 

•002236 


'xy 

-03246 

/xy 

— 02429 


t y y 

^ - 9^37 

^ y y 

■4708 
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From these values by using the formulas already given we find the dis¬ 
criminant function to be 

L = ' ooz - yzox ^—008 i^xy -1—24645^®—i •4776X—i • 5487^+213*655 

This could be used as it stands, but is rather more convenient if simpli¬ 
fied as follows. The terms containing x are *002720*^ — *008170?^ 
— i*4776a; = *002720 [x^ — 3*00367^^ — 543*^35^:]. We now “com¬ 
plete the square” in much the same way as for a quadratic equation: we 
know that (since ^ X 3*00367 = 1*50184 and i X 543*235 = 271*618) 

(a; — 1*50184^ — 271*618)- = — 3*oo367A;y — 543*235^ + 

(i*50i84y)2 + (271*618)2 — 2 X i*5oi84y X 271*618 

or 

*002720 (x — 1*50184^ — 271*618)2 = *002700^;® — *00817^^ 

— i*4776a; + *00614^2 -}- 2*2191 ly + 200*672 

So on comparing this with the original expression for L we find 
L == *002720[jc — 1*50184^ — 27 I* 6 i 8]2 + *2403^2 _ 3-7678^ + 12*983. 

We now repeat the process, by completing the square for y, and finally 
obtain 

L = *oo272o[a: — 1*50184^ — 27i*6i 8]2 + •2403[y — 7*839]* — 1*784 

From this the value of L can be found for any given gestation time .v 
and weighty, and from that the chance of survival from equation (21.30). 
In particular since the first two terms in L are squares, and therefore 
are positive or zero, we see that the least possible value of L is —1*784, 
and this occurs when x — i*5oi84y — 271*618 = o, y — 7*839 = o, 
i.e. at weight y = 7*84 lb and gestation time x — 283*4 days. This 
corresponds to the greatest possible chance of survival = 

aj{a + be~^) — *959i/[*959i+0409^“^’®'*]= *993. 

This can also be presented graphically by drawing the curves in the 
(x, y) plane for which L is constant. These “isocritic-lines” will be 
curves on which the probability of survival is constant. However it is 

convenient to use as co-ordinates X = +*002720 (.x* — 1*5023' — 271*62) 
and Y — +*2403 (y — 7*839) instead of x and y, for then L — 

+ ys — 1*784, and the curx'e on which L is constant will be of the 
form A’'2 + Y- = L 1*784 == (say) i.e. a circle xvith centre at the 
origin and radius R. In the plane with co-ordinates A, F, we can mark 
a grid of lines showing the corresponding values of the original measure¬ 
ments X and y, thereby obtaining the diagram shown in Fig. 21.4 from 
which the probability of surs’ival can be estimated at once. It is of 
interest to note that this diagram is in very good agreement with the 
probabilities calculated by the relatively crude method of dividing the 
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Fig. 21.4—The calculated chances of survival of female babies for 

different birth weights and gestation times 
(From Ann. Euuen. Land., 16 (1951), p. by pernns.sion of the ICditorJ 


number of survivors observed by the total number of any given gestation 
time and weight, even though in deriving the theoretical probability 
we have assumed that the distributions were normal, which is far from 
being the case for the non-survivors. 

These formulas can be readily extended to any number of characters. 
Suppose there arc two samples, one composed of itulividuals of group 
A, the other of individuals of group li; and on each individual h 
nieasurements x, y, z . . . are made. Let be the mean of measurc- 
X in group A, its variance, and the covariance of .v and y. 
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^xx ^zv ^xz • 

Let V denote the covariance matrix I I ^ 

* * * 

I ^xx ^xv ^xz • • • 

the determinant of this matrix, and i = I ^yz • • • 


its inverse. Let the corresponding quantities for group B be denoted 

by capital letters: MVxx-, ^xy* Then the appropriate discriminant 

function is 

L = - ?«x)“ - \Ixx{x - 4 - ¥yy(y - ^riyY - llyy{y - Myf 

-h etc. + ixy{x — {y — ;«„) — Ixv(x — Mx) (y — ^y) 
+ ixz (x — Wx ) ~ »U) — ixz{x — M^) (s — M*) + etc. 

-f- \h In (wIlV) .(21.34) 

If we write x for the vector [.v, y,z, . . .], m for the vector [w*, niy, tn^, ..] 
and M for [Mx, M„, . . .], this can be compactly written in matrix 

notation 

L = i{x - m)i(x - m)' - i{x-M)I{x-My + Vi In (wjlV) 

But this amounts to no more than a shorthand way of writing equation 
(21.34). 

21.12 Simplified discriminants 

The formula given above for the best discriminant L is rather 
troublesome to calculate, as it involves the inversion of the two variance 
matrices v and V. It is also in general rather cumbersome in use. It is 
therefore worth while to find an approximate formula which is easier to 
handle and almost as good in performing the discrimination. 

When the two co\ ariance matrices are equal, v = Vy our formula 
simplifies considerablv, since then 1 = = /, ix* = ^xx> 

= Ixvy etc., and (21.34) becomes 

• L — l^x + + - • • + ^ 

where 

L = Lx (^-fx — "L) + ixv {^'^y — wl) + ixz (-^x — wl) + . . . 

/„ = ixy (.^l^x ~ "L) 4 - iyy (^/y — ^y) 4 " lyj (M ^ — ?«-) 4 " • - • ^tC. 
and C does not involve .v, 3', 5r. . . . . . ( 21 - 35 ) 

Thus in this case we can use Fisher’s linear function l^x + /y V + + 

.... as a discriminant; and there is only one matrix to be inverted. In 
practice the two covariance matrices are rarely equal, but we can take 
the average of the two, = i(v 4- F) an assumed “common covariance 
matrix”, invert it to obtain a matrix and use in place of f in (21.35)* 
[In matrix notation L ~ {M — m)i^x'.] This usually gives a very 
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satisfactory discrimination, although the function can no longer be 
related to a probability as in (21.30). If there are only two characters 
X and y this leads to a particularly simple function (on multiplication 
by w^) 


(21.36) 


where 


= h['^'xx + ^xx]l = ^(“^XU + ^^xv)> 


.t = 

XX 


V 


^vv ~ hi^VU “1“ ^vv)‘ 

Thus for the case of gestation time and birth weight considered in the 
preceding section, A/j. — tftj. = —22*1, A/y — = 1-848, v^xx= 5 ^ 4 ' 

v^xv — 28-79, “ 3*020, and so we obtain the function —iS'SA-'-* 

— 443y; the more negative this is, the greater the chance of survival. 
This is true to a first approximation. But the more accurate quadratic 
discriminant calculated in the preceding section shows clearly that there 
is an optimal point, beyond which further increases in gestation time 
or birth weight impair the chances of survival: this is not shown by the 
simplified linear form. 

Another example given by Fisher {Ann. Eugen. Land., 7 (^ 937 )’ 
179-188) relates to the discrimination of two species of iris, Ins setosa 
and Iris versicolor. Fifty Bowers of each species were taken, and tour 
measurements made on each: .v, the sepal length; y, the sepal width, 
.K, the petal length ; and w, the petal width. The means of these measure¬ 
ments w'cre as follows (in cm)— 

In class A, 7 . setosa: 

Wj — I 46, 


w... s= -2 


m 


.= I 08. 


m* = 5 01 , w/y = 3*43, w* — » 4'^ 

In class B, 7 . versicolor: 

= 5-94, A/y = 2 77, A/j -- 4-26, . 47 ,., = 1-33 

whence 

A 7 ^-w^=- 93 , A/y-Wy=—66, A/,- m,=2-8o. A 7 , 

Furthermore Fisher found the following matrix of common vari¬ 
ances and covariances (averaged for the two species). 

•0996 

•0472 


.t _ 


*953 

•0922 

•0996 

•033* 


•0922 
•1211 
■0472 
•0253 


*255 

•0396 


•033* 

■0253 

•0396 
•025 I 


For example, in 7 . setosa the variance of a-, the sepal length was -1241 
cm^, and in I. versicolor it was *2664 cm-. I he mean of these is t xx 
~‘* 953 > left-hand element of the matrix Similarly the co- 

variance of .v and y was t’,y = -0992 cm-'in re/orrt, and Fjy = 08520111 


uu 
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in versicolor, with mean = *0922. The inverse of this matrix v 
can be found to be 



-799 

3-28 

21-49 

-2668 


3-86 
—10-89 
—26-68 
87-81 


Multiplication of the vector (A/ — m) into this matrix, by formula 
(20.75), gives us the vector 

/= {M — m)i^ = [—3-1. —18-1, 21-7, 30-9] 


i.e. the appropriate linear discriminant function is 

L = —2-ix — iS iy - 1 - 21-yz 4 - 30 - 9 ^. 

This can be still further simplified by division by 3: it then becomes 

Lj = —.V — by + 7ar -h ^ow 2; L/3 


a formula which is sufficiently accurate for all practical purposes. By 
the use of /.j we can separate the two species completely. The valu^ of 
Lj for the setosa plants cluster around a mean value 2; —13, and those 

for the versicolor about a mean 2: 21, 

This method still involves the inversion of a matrix, which can be 
very laborious if there are many characters. L. S. Penrose has sug¬ 
gested a further simplification, which effectively reduces the discrimin¬ 
ation to one involving two characters only, without any great loss in 
efficiency. These two characters are defined as 

T = dr xjs'^x rt: • 




.V -h 




+ ■ . . ( 21 - 37 ) 


the same sign being chosen for the term in T as for the corresponding 
term in We can then find the means and variances of T and U in 
the two populations, either by calculating the values of T and U for each 
individual separately, or else by using formula (20.34) for the mean and 
variance of a weighted combination. We can also similarly find the co- 
variance of T and U. It is then an easy matter to calculate, by formula 
(21.36), the best combination of T and U, and that in turn is equivalent 
to a weighted combination of the original measurements. 

For example, from the data we have already given for the two species 

of Iris we obtain 

T = 2-3.V — 2-9y + zSz + 6-yzo 
U = 4-8x — 5 - 5 y + 22-3^ + 43-ottJ 
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My—mr=2-3(M*— ma:)—2-9(M„—//i„)+2-8(ikfj— w*)+6*3(M^— 

= 187 

My—mc7=4-8(M3.—OT*)—5-5(My—mJ-l-22-3(Mj -ot.)4-43o(M^.— m„) 

= 117.0 

Average variances and covariances: 

Zjtyy=4 7S2, e;Vc/= 29-52, v^uu=^ 99 S° 

'^^uui^T — '^r) — — ^v) ~ 277 

•o^rriMu — "*£/) — v^rui^'^T — ^r) = 4‘0 

Thus the best linear function of T and U is 

= 2777’ + 40C/ 

= 656.V — 826J + 8652: -f* i9i7«t; 

By dividing this through by, say, 140, we obtain the function 4 7.V— 5-9jv 

+ 6-22r -f- i3-7et», as compared with = —.v — 6y -f- 72" + ob¬ 

tained by the matrix inversion method. Although these two functions 
are not identical, there is clearlv a resemblance between them, and the 

w ^ 

second form involves considerably less labour in its calculation. 



CHAPTER TWENTY-TWO 


COLSON NOTATION 

ARITHMETIC MADE EASY 


22.1 The development of arithmetical notation 

Nowadays we rarely use Roman numerals, except with the names 
of kings, queens or dynasties, or to mark dates on monuments; or, 
again, to supplement Arabic numeials, e.g. in distinguishing the main 
parts or divisions of classified material. Certainly they are never used in 
calculation. Nobody would struggle with a sum like XLVI + CXXIII 
— LIV, when it is so much more easily written as 46 + 123 — 54 * 
And who would like to do multiplication in Roman numerals, CLXIX 
X MCMXVI? As for fractions, decimals, the solution of equations, 
and all such computations—the mere thought is sufficiently terrifying. 


What would 


XI 

vn 


XXIX 

XII 


IV 

XXI 


work out to be? 


One’s first and 


most natural reaction is to turn it into Arabic numerals, and that as 
quickly as possible. It is indeed said that a merchant wished to send 
his son to a university, at a time when Roman numerals were still in use, 
and asked the professor what mathematics were taught. The professor 
replied, “Our course in arithmetic extends as far as addition and sub¬ 
traction. Rut if your son wishes to learn multiplication and division he 
will have to go to Rome”. I'ero o ben trovato. 

Because Arabic notation is so convenient and efficient we are apt to 
think that it is almost perfect—if indeed we think about the matter at 
all—and that any attempts at improvement, if indeed they were at all 
possible, would not be worth the trouble. However, that is very far 
from the truth. In 1726 J. Colson, an early Fellow of the Royal Society, 
discovered a very simple ai^d neat device by which almost all arith¬ 
metical calculations can be considerably simplified. This device has 
indeed been very much neglected: but that is not at all a good reason 
for continuing to ignore it. Colson himself called it “negativo-affirma- 
tive arithmetick”; but it seems more appropriate to call it “Colson 
Notation” in honour of its first discoverer. The sections which follow 
are devoted to an explanation of its nature, its uses, and its advantages. 


22.2 Negative digits 

In order to understand what Colson achieved let us return for a 
moment to the Roman system. In it not only is the number eleven 
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written as XI, i.e. as ten plus one, but also the number nine is written 
with the same symbols as IX, one before ten. Here the inversion of the 
order shows that the I has to be subtracted from the X. In the same 
way IV stands for four, and XL for forty. This sort of anticipation is 
common enough in everyday speech; in telling the time, for example, 
one will speak of “three minutes to four”, although that will be written 
as “3-57”, for there is no way of writing “three minutes to four” in 
ordinary Arabic figures. When Arabic numerals replaced Roman ones 
this device of anticipation was lost; what Colson did was to put it back 
again, and to put it back in an improved form. 

We ordinarily write twelve as 12, meaning 10 + 2. Suppose we 
think of eight as 2 less than 10; we might write this as iz*, the inverted 
figure 2 meaning that it is counted backwards, i.e. subtracted. Similarly 
seven can be looked on as 3 less than 10, and written if. Thirty-seven 
is 3 less than 40, or 4e; eighty is 20 less than 100. or iro, and ninety- 
eight is 2 less than 100, or loz. In this notation we can write “three 
minutes to four” as 4.e, and “twenty minutes to three” as 3.20. 

‘‘But what is the advantage of this? * the reader w'ill ask. Perhaps 
the following addition sum will suggest the answer; 8 + 12 = 20. If 
we write this in Colson form it becomes 

iz 

12 


20 


When we add the figures in the right-hand column (as in any ordinary 
addition sum) we see that the z cancels the 2, since the first is a backward 
or subtractive 2, and so the right-hand figure of the total is accordingly o 
with nothing to carry. 'I he symbol r is in fact merely another way of 
writing -2. This sort of cancellation occurs very frequently m addi¬ 
tion; for example suppose we wanted to add seven, twenty-four, eighty- 
three, and twenty-six. In ordinary Arabic notation this would be 

written 


7 

24 

83 

26 


140 


We say of course “7 T 4 ^ 3 + = 20”, writing o and carrying 

• Colson himself wrote 12. but the inverted f.Kure seems ... be more legible and 
tore ele^nt though admittedly unfamiliar. And a bar over a b^urc can be readily 
i'crlool<?d iri readiiw or writinK, with possibly disastrous tonse.jui nccs. 
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2; “2 + 2 + 8 + 2 = 14’* writing 4 and carrying the i to the next 
place. But in Colson form it would be written as follows: 

le 

24 

3 ^ 


140 

In the right-hand column the C cancels with the 3, and the 4 with the h, 
so one can see at a glance that the total is o. In the next column the 2 
cancels with the z, so it is only necessary to add 14-3 = 4. In the 
left-hand column it is only necessary to bring down the i. Notice that 
in Colson form there is no carry over in any of the columns—another 
simplification. Of course not all additions will be quite so much simpli¬ 
fied, but as a rule there will be considerable cancellation, and little or 
nothing to carry. In fact in a long addition sum the carry-over can be 
expected to be reduced to about one-tenth of what it would be in ordin¬ 
ary Arabic numerals. 

We shall accordingly introduce five new figures, \, 2, £, t, and S, 
standing for —1, —2, —3, —4, and —5 respectively. But as a compen¬ 
sation for the introduction of these five new digits we can now dis¬ 
pense with 6, 7, 8, and 9 if we so wish, so that the total number of digits 
will only be increased by one. For as we have seen 28 will now be written 
as 32, 47 as 5^, 80 as 120, 83 as 123, and 98 as 102. A number like 68 
seems at first to be a little more troublesome, but it is 70 — 2 ~ lOO — 
30 — 2, and therefore will be written as if2. Similarly 187 = 190 — 3 
-- 2\£. (If any difficulty is experienced in conversion to or from Col¬ 
son notation, start from the right-hand figures and work to the left. 
Thus 187 ends in 7 if; write f, carry i- 14-8 = 9—11, write t, 
carry i. 14-1 = 2. Thus the number, reading again from left to right, 
is 2[f.) 

PROBLEMS 

(1) Write the following numbers in Colson notation: 6, 26, 46, 346, 
27, 1706, 809, 56, 67, 19178. 

(2) Write the following in Arabic numerals: 2h, 52, if, ifi, 2^3, 
if2. 3th, 3002. 

22.3 Negative numbers 

In ordinan,' arithmetic there is no way of writing a negative number 
except by writing the corresponding positive number and prefixing a 

minus sign: e.g. —234. This is perhaps comparable to the custom of 

referring to a woman bv her husband’s name, and prefixing “Mrs.** But, 
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quite apart from such a comparison, this custom does tend to put 
negative numbers in a rather inferior position, and to make them awk¬ 
ward to handle. 

In Colson notation the situation is ver>' different. The numbers 
— I, _2, —3, —4, and —5 are already represented by the single digits 
If z, £, ff and £ respectively. It follows that the symbol zo will represent 
z tens, i.e. —20, and in the same way to will represent 40, too = 
—300, and so on. A symbol like £z will mean t tens plus z, i.e. — 32. 
In fact in order to change the sign of a number we merely ha\e to invert 
every figure in it: —12 = iz, —23 = z£, 18 2Z c2, ~i42 

= xtz and so on. So negative numbers are just as easy to write as 

positive ones. 


PROBLEMS 


if possible satisfy the conditions that they should be 
‘member and vet quite distinctive and not liable to con- 
'ordinarv positive digits. The following suggestions are 


(1) Write the following numbers in Colson notation: —144. 

—28, —13, —368. 

(2) Turn back into Arabic form: zz, £4, zif^ Z2. 

If Colson notation is to be used with ease and confidence it is essen¬ 
tial to have suitable names for these special digits, I. ^ 

names should 
short, easy to remember 

fusion with the ordinary positiv - ,-,1 u 

made with some diffidence, but it is not easy to find a better altcrnam e. 
I is negative unity: it may therefore be cal ed m’g. z is doubh '<=■ 

and therefore is named </ou/>-. (. being tn|dy negative, is Inp. Sin ilarl> 
the quadruply negative h is ijutitl, and S is . n> ot ter iiiiin ter i 

read out figure by figure; ir 3 1 is one ilmib three pmiil iicg. 

This ability m write iiegalive ntnnbers as well as positive ones 
brings a great simplification into many calcn .ttioiw ^ 

means that we can dispense with minus signs .1 toge ler 1 c ' 

Now even the most expert and celebrated tnathent.nicians ° 

make mistakes in algebra through getting a sign "'’"'’e- * '"'Y 

indeed be some comfort to beginners who are ' 

But the important thing is .0 get the right answer: m.d > ^ 'Y" 

notation is a great help, for it enables one to write a 2, for in.ta , 
as * + a and ax - 3 as a.v 3- £, and so turn every sign into a plus. 

This considerably reduces the likelihood of making an error in sign, 
once the notation has been properly mastered. 

Note.—We have used above the symbol £ for qum, “ 5 - 1 is 

fairly convenient for printing or typewriting. Hut 

make such a symbol in ordinary liandwriting, espec • X ovmbol 

quickly. It may therefore be better to write a live thus, -t; , iliat X ^ 
2 an b^ readily and distinctively inverted as Another possibility 
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would be to use the letter v for five, as in Roman numerals, and its 
inverse a for —5. 

22.4 Addition 

We now explain how to perform the four elementary operations of 
addition, subtraction, multiplication, and division in Colson notation. 
These will indeed be done by processes similar to those used in ordinary 
elementary arithmetic. But it will be necessary to remember when 
adding (for example) 2 and 4 to write the total six as ih, and not as 6; 
and in the same way 2 X 4 ^ 12. not 8. So certain parts of the addition 
and multiplication tables must be relearned, although other parts (such 
as 2 + 3 = 5) will remain unchanged. 

The Colson addition table is as follows (Table 22.1): 


Table 22.1—Colson addition table 



S 

1 

b 

e 

z 

1 

0 

I 

2 

3 

4 

5 

s 

10 

II 

12 

13 

14 

s 

b 

£ 

2 

1 

0 

b 

u 

12 

13 

14 

S 

b 

£ 

2 

1 

0 

I 

i 

12 

13 

14 

S 

b 

£ 

2 

1 

0 

1 

2 

z 

13 

14 

S 

b 

i 

2 

I 

0 

I 

2 

3 

I 

14 

S 

b 

£ 

Z 

1 

0 

1 

2 

3 

4 

0 

S 

b 

C 

z 

i 

0 

I 

2 

3 

4 

5 

I 

1 ^ 

e 

Z 

1 

0 

1 

2 

3 

4 

5 

ib 

2 

! f 

z 

I 

0 

I 

2 

3 

4 

5 

ib 

i£ 

3 

1 ^ 

I 

0 

I 

2 

3 

4 

5 

ib 

i£ 

12 

4 

1 

0 

I 

2 

3 

4 

5 

ib 

i£ 

12 

II 

5 

1 

0 

1 

I 

2 

3 

4 

5 

ib 

i£ 

12 

II 

10 


(This is to be read as S -f S = lo, ? 4- b = 11, etc.) By using this 
table any addition sum can be readily performed: e.g. 1023 4 - ziz 4 - 
241:5 4- 55s 4 ' zozi — 2foi. This addition would be written as 
follows: 

I o' z 3 
z^i z 


242 

5 5 
2 0 2 



Z 

£ 


2 £ o 1 

The figures shown in bold type cancel out. Thus, beginning wath the 
right-hand column, we see that 3 cancels with f, and we are left with 
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2 H- 5 H- z. But 2+5 = 3, 3+2=1, and so the last figure of the 
total is I, with no carry. In the next column 2 cancels with z, and 
£+5+2 = 0, again with no carrying figure. The third column gives 
2+4H-5 =i£, so we write £ and carry i; the left-hand column gives 
simply I +1 = 2. 


PROBLEMS 

(1) Find 124 + 222 + i£3K 

(2) Find 14^ + 3£25 + 1214 + 4 ‘^^- 
0 ) Find 1144 4- 3313 + 3225 + 410. 


22.5 Subtraction 


In school, once we have learnt addition we go on to a new and more 
complicated process, that of subtraction; e.g. 424 — 312 = 112. In 
Colson notation there is no new process to learn. For tlie subtraction of 
312 is equivalent to the addition of —3*2, i.e. of £12, and so the sum 
will be written 424 + £[2 - 112. In other words, to subtract a number 
in Colson notation merely change the sign and add. Subtraction as a 
separate process is entirely abolislicd. It is already covered by addition, 
and the four rules of arithmetic are reduced to three. 

This has the further consequence that any number of additions and 
subtractions can be done in one single operation. 1 hus in ortier to 
calculate 1023 — 232 + 2385 -b 548 — 2023 in ordinary arithmetic 
we must first separate out the posiiise and negative nurnliers, and add 
them separately. But in Colson form it becomes 1023 + z^z 2425 
+ 552 + 202£ = 2£oi ; tliis is precisely the sum chosen to illustrate 


Colson addition in Section 22.4. 

It follows that financial accounts \vould be greatly simplifietl. In 
Colson notation there will be no need to put items ot expeiuliiure and 
income into difTerent ctjlumns, will) sonieiinies conipleiely meaning¬ 
less totals. 'I’hey can all be written in a single column, expenses being 
counted as negative. ^I'hc total of this will then be the balance in hand. 


22.6 Multiplication 

The multiplication table in Cfjlson form is also simpliiietl. In tlie 
first place a product by one of the digits i, o, J can be written down at a 
glance. For multiplication by i leaves a number unchanged, e.g. 
4 X 1=4; multiplication bv o abvays gives o; and multiplication by l 
merely changes the sign, c.g.'4 X I - Nearly lialf the multiplication 
table consists of products of this form. Secondly, from a product such 
as 3 X 4 = 12 we can immediately deduce no less than seven others by 
®PP^ymg the rule of signs. 'I'hiis £ x 4 ( -3) ^4 ( 3 ^ 4 ) 

= iz, and in the same way 3 X t" = 12, aiul £ X b - ( -3) ^ ( 4 ) 

= 3 X 4 = 12. Also by writing these prn<lucts in the reverse order 
we see that 4 X 4 = 4 x £ = 12, 4 x £ 4 ' 3 — 12. So there 
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are essentially only ten distinct products to be learnt in the Colson 
multiplication table, namely 

2X2 = 4 2x3 = 2x4 = 2X5 = 10 

3x3 = IT 3x4=12 3x5 = 15 

4x4 = 2^ 4X5 = 20 

5 X 5 = 25 

The remainder can be readily deduced from these. Note that of these 
ten products no less than six are identical with the ordinary Arabic 
forms: only the four shown in bold type are new. Accordingly the 
complete multiplication table (excluding the obvious products by i, o, 
and t) can be set out as follows: 


Table 22.2—Colson multiplication table 



S 

4 

£ 

z 

2 

3 

4 

5 

s 

25 

20 

15 

10 

10 

iS 

20 

zS 

4 

20 

24 

12 

12 

12 

T2 

24 

20 

e 

IS 

12 

n 

i4 

14 

Ti 

12 

iS 

2 

10 

12 

i4 

4 

4 

T4 

12 

10 

2 

10 

12 

14 

4 

4 

i4 

12 

10 

3 


12 

U 

14 

i4 

II 

12 

IS 

4 

20 

24 

12 

12 

12 

12 

24 

20 

5 

zS 

20 

\S 

10 

10 

IS 

20 

25 


EXAMPLE 

(1) Multiply 522 by iz. 

5 2 2 
I z 


I o 4 f 
5 2 2 

4 t 4 4 

ExplaJiation 

We first multiply 522 by 2, the right-hand figure of the multiplier, 
as in the usual method for multiplication. Thus 2 X 2 = 4 , 2 X 2 = 4, 
^ X 2 = 10, write o, carry | to the next place. We continue with a 
multiplication (of the original number 522) by the next figure i of the 
multiplier; 522 X i = 522, written one place to the left. An addition 
gives the product 4144. 
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PROBLEMS 

(i) Find 22:2 X II, III X zz, zz-, X 52, 241 X 513. 

22.7 Division 

In Colson notation there will be a slight alteration in the principle 
of division. In Arabic numerals when we divide x, say, by y, we ask 
*‘How many times does x go into y ?" or equivalently, “How many times 
can we subtract x from y”. But in Colson form subtraction of .v is re¬ 
placed by addition of (—.v), and therefore the question will be rephrased 
as “How many times must we add (—.v) to y to give as small a total as 
possible?” (i.e. as small as possible in absolute magtiitude, irrespective 
of sign). 

There are various ways of answering this, but the following method 
of successive adjustments seems both rapid and simple. Suppose we 
wish to divide y = 122021 by x = 124. The first step is to reverse the 
sign of X, giving (—a:) — \zf. We next form a table of multiples of 
(— x), as follows (see Fig. 22.1): 2 X (—-v) = 2 X izi’ = zSz; 3 x (“-'*) 
= I X (_ x) 4 - 2 X (_a:) = f’32 (by addition of the two numbers 


1 X (—x) = \zf {y 

2 X (—x) = zSz 

3 X (-x) = f^z 

5 X = 1420 



50* 

+■32 


131 

I zb 


Remainder = 15 
Fig. 22.1—Colson division 

already written); 5 X (—-v) 2 X (—-v) > 3 X ( —v) = 14^0 (by 

addition of the two numbers immediately above it in r jk- ^2 . i). n 
right of these multiples of (—-v) wc write the number y 122021 to be 
divided. We now take the first tliree figures oi y. 122. and cotisider what 
multiple of (-.v) must be added in order to cancel them out as nearly 
as possible. Clearly the correct multiple is i X ( 

added on, giving the total z, and i is written above the line as the first 
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figure of the quotient. The next step is to bring down the next figure o, 
of V, thus turning the total into zo. We consider what multiple of (—:«) 
must be added to this to cancel it out. Clearly there is no multiple which 
will do this, for zo is so small that the addition of any multiple of 
will increase it in absolute value. We therefore write o as the next 
figure of the quotient, and bring down the next figure, 2, of to give 
us Z02. Since this is negative it will be necessary to add on a negative 
multiple of (—x) to cancel it; and the appropriate one is z X (—x) = 
252, w^hich is simply the reverse of 2 X (—x) = ZS2.. This is added on, 
giving the total 50, and z is written as the third figure of the quotient. 
Again the next figure of y is brought down, to give 501, and to this we 
add 3 X (—.v) and enter 3 as the last figure of the quotient. However 
the total we obtain is 13X, and this is not as small as we can make it, for 
an addition of i X (—x) will reduce it still further. So we add a further 
I to this last figure of the quotient, giving the final value 1023 + i 
= loz^, and we also add i X (—.v) = izf to the total 13X, giving the 

final remainder iS. . 

The essential point is that at each stage of the calculation a sufficient 

multiple of (—.v) must be added to bring the total down to something 
not exceeding -5 X (—*v) in absolute value. It is easily seen vhen 
has been done, for on the left of the division sum we already have the 

value of 5 X (—-v) = X42o, and hence by (mentally) shifting the decimal 

point we see that -5 X (—.v) = l^z-o. Thus if the first choice of the 
multixde was the wrong one, and does not reduce the total sufficiently, 
it can be easily corrected by adding on a further multiple. It is only 
after this correction has been performed that we proceed to bring down 

the next figure of v. , • r j 

This method of division is applicable whatever the signs of .v andy 

may be. whether both positive, both negative, or mixed. The procedure 
is exactly the same in all cases. However it has one feature which is at 
first sight a little unusual—it may give a negative remainder. But a 
little reflection will show that this is quite reasonable, and indeed often 
preferable to the usual positive remainder. When 44 is divided by 15 
in the usual way the quotient is 2 and the remainder 14; in Colson form 
it will have quotient 3 and remainder x = —t. s»rice 3 X 15 — 45 - 
Now 44''i5 ^ 2-9333 ... (or 3-1333 - . -), and this is much nearer to 3 
than it is to 2. So the Colson method gives the more accurate answer. 
(On the rare occasions on which it is essential to have a positive re¬ 
mainder this can be readily achieved by suitable adjustment in the last 
stage of the division.) 


PROBLEMS 

(1) Find looi/ii, 341/11, 1001/13. 

(2) Divide 51^50 by 131, finding quotient and remainder. 
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22.8 Decimals 

Decimal fractions obey the same rules in Colson notation as in 
ordinary Arabic numerals. Thus since -h 24 + 123 H- 3^" = 140 
(Section 22.2) we have iT + 2-4 4 - 12-3 -r 3 +’ = 14 0 and 4 - *24 
4 -1-23 4 --3^ = 1-40. Andsince522X 12 = 4it+(Section 22.6) we have 
5*22 X 1*2 = 44hf’. The decimal point in this last multiplication must 
be placed 24-1 = 3 places from the right, according to the usual rule. 
In particular the use of decimals enables us to cliange a division by 2 
into a multiplication I>v ^ ^ ' 5 ' 2nd in the same way divisions by 4 > 5 ' 
12, and 25 can be replaced by multiplications by -25, -2, *125 and -04 
respectively. 

However there are further simplifications. It \Ne wish to write 
TT = 3-14159265 ... to 4 places of decimals (in Arabic numerals) the 
last figure 5 must be changed to a 6, because it is clear that tt is nearer 
to 3-1416 than to 3.1415. Similarly to 3 places tt wfill be 3*142. not 
3'i4i. In Colson form there is no need to adjust the final figure in this, 
way. IT = 3'i42tl3't’5 . . . ; to 4 places it is 3 ;* 4 -‘^- simple 

omission of the remaining figures, and to 3 places it is 3142. In the 
same way e — 3-f223222325 . . . becomes 31223 to 4 places (>1 
decimals, and ytzz to 3 places. (Sirictlv speaking there are a tew- 
exceptions to this rule. Thus -1253 to 2 places is really -13, not -12. 
since it is just a little nearer to -13 than it is to 12. But the error coin- 
mitted in ignoring these exceptions is very small, and in the great 

majority of calculations can be safelv neglected.) 

Many people are a little puzxled at first by die statement that the 
recurring decimal *9999 ... is etiual to 1. 1 liey feel that m sonic \aguc 

way this runs against common-sense. But what is meant is o cmirse 
that the sequence of decimals -9. -99. •999- • • ■ approaches 1 as a 
In Colson notation the apparent p.iradox disaiipe.irs. I (ir t le scijuence 
becomes i-], 101, i-ooj, l oooi, . . . wliich cle.irly tends to 1. 


FRO HI. I-MS 


ress ./II, ./13 a.ul ./ 4 £ recrriiiK ileciMK.Is in Cnison 

/hat do you notice about the hrst three ot these ilecitn.ils. 


(1) Express 

notation. ^ 

(2) Express 2/i£, 3/1 1 ', i/if, e/if. recurring decimals. Com¬ 

pare these with i/if; wlial do you notice.^ 


22.9 Interchangeability of five and quin 

Colson notation, as we have presented it. has one disadvantage, it 
has eleven digits, as compared willi only ten in Ar.ibic notation. n ac 
it has one more than we really need: the dip'll qmn , - , is supci uous. 
Thus 2S means 20 - 5, i.e. 15, 4S - 35* ' it sell ean le written 15, i.e. 

— 10 -p 5. Wherever S occurs wc can if we wish replace it by 5 - Pro¬ 
vided that the next digit on the left is decreased by i. Convcisely wc 
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can replace any 5 by S, by increasing the next digit on the left by i, e.g. 

25 = 35> ^5 = 

But although S is not strictly an essential part of the Colson system, 
it is a very useful digit, and to do without it entirely would d^troy the 
sjrmmetry of the notation. However in the formal presentation of the 
final results of calculations it seems desirable to have some definite 
convention as to when 5 is to be used, and when S. For without such a 
convention there would be several ways of writing certain numbers, 
and that might lead to inconsistency and confusion. The simplest rule 
is to use 5 for positive numbers, and S for negative ones, thus preserving 
symmetry; —45 = —525 = SzS. (But other conventions are 

possible, e.g. to use 5 or J in such a way as to make the next figure on 
the left always even. Other rules may suggest themselves to the reader.) 

22.10 Miscellaneous operations 

All arithmetical rules can be translated into Colson notation. Many 
rules will require no translation at all: this is true for example of the 
tests for divisibility of a number by 2, 3, 9 and ii, discussed in Section 
2.2. They apply equally well in Arabic or in Colson notation. We c^ 
also state the rule for divisibility by 5 in a similar universal form: “a 
number is exactly divisible by 5 if its last digit is divisible by 5* 
divisible by 5- = 25 if the last two digits are divisible by 25, and by 
^3 = 125 if the last three digits are divisible by 125”. 

Where arithmetical operations are substantially changed by trans¬ 
lation into Colson form they are almost always simplified. Thus con¬ 
sider a table of cosines: in Colson form this will give, say, cos 23*1° — 
i‘l20Z, with mean differences for the next figure as follows 

second decimal 12345 

difference i j z £ £ 

We notice at once that these differences are negative: there is no 
need to print the injunction “subtract differences*’ at the head of the 
table. We also see that there are only five printed differences for final 
figures I to 5. I'hose for final figures j to S can be immediately deduced 
by a change of sign. Thus the difference for 3 is given as z, so that 
cos 23 13° = I'lzoz + z = I'lzof. It follows that the difference for 
£ must be 2, and cos 231^°= i-\2oz + 2= iqaoo. In Colson 
notation there will be no differences for 6, 7, 8, or 9; and that omission 
has two important consequences. Since the differences for i, 2, 3, 4 » 5 
are smaller than those for 6, 7, S, and 9, the arithmetic will be lightened. 
Also the smaller mean differences are less liable to error than the larger 
ones, and their use will make the table more reliable. (For that reason 
many tables in ordinary Arabic notation have mean differences only up 
to 5, thus implicitly using the principles of Colson notation for the last 
digit.) 
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In ordinary calculations it is usual to write log 35 as 1-5441, and log 
•035 as 2-5441. These numbers which are partly negative and partly 
positive are a bit of a nuisance, and require special treatment. But in 
Colson notation they are written X'544i and 2-5441 (since i is the same 
as T, and z = 2). So written they are no longer in any way exceptional, 
and call for no special comment. In the same way because log 2-1 
= -3222 we see that log 21 = 1-3222, log -21 = 1-3222, log -021 = 
2*3222; and from log 5 = i-foio it follows that log 50 = 2-€oio, log -5 
= -Coto, and so on. Note that the usual rule for finding the integer 
part (or “characteristic”) of a logarithm by subtracting i from the num¬ 
ber of figures to the left of the decimal point cannot always be applied 
in Colson notation; but it is always quite easy to find the correct logar¬ 
ithm in the manner indicated above. 

The reader may like to investigate for himself the application of 
Colson notation to other processes. It is specially useful in interpol¬ 
ation (Section 3.8), the solution of simultaneous equations (Section 
17.4), and in matrix calculations (Chapter 18). 

PROBLEMS 

(1) Show that a perfect square cannot end in 2, 2, 3. C, when written 
in Colson notation. 

(2) Show that the cube of any whole number w either ends in the 
same figure as ti, or with that figure inverted. 

(3) Show that all prime numbers other than 2 and 5 end with 1, l, 
3, or e. 

(4) Arrange the integers from t to 4 in a “magic square” of three 
rows and three columns in such a way that the numbers in each row, 
in each column, and in each of the two diagonals all add to the same 
total. 

(5) Write the odd numbers from [S to 15 in order in a 4 x 4 square, 
thus 


iS 

le 

n 

n 

13 

s 

£ 

I 

1 

3 

5 

I £ 

* [ 

11 

*3 

15 


By changing the signs of certain of these numbers convert tliis into a 
magic square. 

22.11 Epilogue: what is mathematics? 

In this book wc have been mainly concerned with looking at biology 
from a mathematical standptjint. Perhaps it may be worth while to 
reverse the procedure, and spare a few lines to look at mathematics from 
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a biological point of view. For mathematics is an activity of a living 
creature, man. In fact it is a special kind of language. 

But what is language? This is indeed a question with no simple 
answer. Life and living things are flexible, adaptable, with many 
activities, hardly to be reduced to any simple formula; and language, a 
product of life, is itself many-sided. It is a means of communication, 
a vehicle for emotion, a device for thinking, a game. . . , Even words 
themselves, the units of language, are surprisingly difficult to define. 
Consider merely such a concrete, everyday word as “dog”. How has 
this come to cover both small and large animals, hairy and relatively 
naked, long and short, white, black and bro\vn, tame and wild? How 
do we distinguish between a “dog” and a “cat”? Mathematics, too, 
which at first glance seems rigid and mechanical, shows many signs of 
its living origin. Its adaptability is made evident by its applications to 
so diverse subjects as medicine, genetics, evolution, physics, chemistry, 
astronomy, and economics. The sign + stands for “addition”; but 
“addition” means something a little different when applied to whole 
numbers, fractions, complex numbers, vectors, matrices, probabilities, 
etc.; all these we manage to cover quite comfortably b)" the single word 
“plus”. 

Perhaps the best way of looking at language is to consider it prim¬ 
arily as a special kind of tool by w'hich we grasp the world around us, and 
so better mould it to our own desires. Such a tool greatly enlarges our 
capabilities, just as ordinarj" mechanical tools enable us to do much that 
would be impossible with hands and feet alone. But even the best 
instruments need human guidance, and even the best are inferior to the 
human body in adaptability. 

Mathematics also needs to be wisely guided. Like other instruments 
it can be used properly and with discrimination, or foolishly and in¬ 
appropriately. A steam hammer is not used to crack a nut, let alone to 
mend a watch; neither should mathematical methods be employed 
merely mechanically. A good workman is also skilful with his tools, 
knowing the most effective methods of work. But even the finest and 
most effective instrument can be put to base as well as to noble purposes, 
to impoverishment and destruction as well as to the enlargement of life 
and the creation of beauty. It is the purpose of this book to help the 
reader to appreciate the power of mathematics, to understand its tech¬ 
niques, and to encourage him to use them skilfully, intelligently and 
nobly. 
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EXPLANATION OF THE TABLES 


Table i—The Greek alphabet 

This table is included for reference. It gives a list of the letters of the 
Greek alphabet, and their names (in the customar>' English spelling). (Note 
that the ch in chi, x> ‘S always pronounced hard, i.e. like k.) 


Table a—Common logarithms 

This is a four-figure table of common logarithms, with mean differences. 

iVIost tables of logarithms do not give four-figure accuracy in the early part 
of the table if mean differences are used. In order to remedy this defect the 
values of log x are here tabulated for each increase of "005 in x when x lies 
betw’een 1*0 and f5, instead of the usual tabulation at inter\als of "Oi. Thus 
the table gives log I'ooo = *0000, log I'oos = '0022, log I'oio = *0043, log 
1-015 “ *0065, and so on. Logically these should be written in order in a 
row; but in order to save space and to present the tabulation in a convenient 
form the value of log 1*005 (*= 0022) ‘S printed immediately below that of 
log 1*000 (=*0000). The value of log 1*015 (=*0065) is printed immediately 
below that of log 1*010 (=* *0043), and so on up to log i *495 (~ •i 74 ^>) ^'hich 
is below log 1*490 (= *1732). After this point the tabulation proceeds in the 
usual way, since the extra entries are no longer needed to ensure sunicient 
accuracy. Thus in the row opposite “t* 5 ” in the left-hand margin occur the 
values of log 1*50 - *1761, log 1*51 = *17^®* 1*52 — *1810, and so on. 

This latter part of the table is to be used in the custontan.* way. 

Example. To find log 2*544. 

Opposite “2*5'’ in column •*4” find log 2*54 = -4048 
Mean difference for last figure “4" — 7 


log 2*544 = *4055 

The use of the first part of the table is almost as simple, once the principle 
of tabulation has been undcrsloud. 'rhus to find h>K **030 wc look ulon^ the 
row marked •'l o” as fur as column headed "i". finding log 1*030 « 0128. 
Underneath this will be found log 1*035 *= omu- In this part of the table 
mean, differences will only be required up to 4 in the last place, since, tor 
example, “1*037” can be looked on as ‘‘1*035 + 2 m the last place , and its 
logarithm can be found thus-- 


log 1*035 
diff. for 2 


•0149 

8 


log 1*037 


*0157 


Further accuracy is gained by taking advantage of the available space 
giving two sets of mean differences opposite “l o '. I he upper set is ina 


and 

giving two sets of mean dillerences opposite ”i*o ’. I he upper set is marked 
^•(1*00-4)”. meaning that it is to be used when the first three figures of the 
number lie between 1*00 and 1*04 inclusive; the second set is marked (1*05 
9)”, and is to be used when the first three figures he between 1 *05 and 1 *09 
inclusive. 
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Example. To find log 1-034. 

Opposite “i-o” in column “3” find log i '03 = -0128 

Difference for “4”, in upper row, = _^ 

log 1*034 = -0145 


This standard of accuracy will be found sufficient for most purposes. But, 
if so desired, it can be still further improved in some cases by the use of 
negative differences (Colson fashion) where the number approaches the next 

tabulated value. 


Example. 
1-910. Thus 


To find log 1-919. We note that 1*919 is nearer 1*920 than 

log 1-920 = *2833 

diff. for —1 = —2 


Example. 
1-220. Thus 


log 1*919 “ *2831 

To find log 1*224. We note that 1*224 is nearer 1*225 than 

♦ 

log 1*225 ~ -0881 

diff. for — I = —4 


log 1*224 = *0877 

Antilogarithms can be found by the inverse use of the table. 

Example. To find antilog-7813. 

IJv inspection, the nearest entry in the table is log 6*04 = *7810. This falls 
short of the required value -7813 by 3 - The entrs* “3” in the mean difference 
table occurs in the column headed “4”; therefore log 6*044 = '78io + 3 0 ^ 
the last figure) = -7813. or antilog *7813 « 6*044. 

Natural logarithms can be found by the formula 

In .V = 2 3026 log .V 

and exponentials by the formula 

e-t = antilog (*4343.v) 


Table 3 Natural sines 

This table gives the value of sin -v'’, where .v is expressed in degrees and 

decimals of a degree. 

Cosines can be found by the formula 

cos .v® — sin (90 — •x)'’ 


An inverse use of this table gives the value of sin'^x, expressed in degrees; 
and cos”‘.v = 90“ — sin^'.v. 

Also sin ( —-v)'* = —sin .v® 
sin-*(—•'■)= —sin-*.v 

This decimal tabulation is preferable to the usual tabulation in degrees and 
minutes, both on the general ground of its greater simplicity, and also in the 
evaluation of integrals such as 



const. + sin”^ .x (in radians) 
const. + *01745 sin"* x (in degrees). 


However the equalities o*i“ = 6 minutes, 0*05“ = 3 minutes enable it to 

be used with the division into minutes. 

I radian = 57 * 3 ®* 

1° = *01745 radian. 
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ExantpU. To find sin 69-52®. 

From table, sin 69*5® == ‘9367 

difF. for 2 = I 

sin 69-52® = ‘9368 

Example. To find cos 20-48°. 

cos 20-48° = sin (90 — 20-48)® 

= sin 69-52® 

— -9368 (see preceding example). 


•O-Q 


dx 


( I - X*) 


Example. To find f 

J O 

This is -01745 (sin“‘ 0-9 — sin“‘ o). But from tables: 

sin 64-1® = -8996 

mean diff. for 5 = 4 


sin 64-15° = -gooo 

Thus sin-‘ 0-9 = 64-15°, sin"' 0=0, and therefore the integral is 01745 
64-15 = II19. 


the n^ain 
first 
for 


Table 4—Normal or gaussian integral 
This table gives the values of 

P(A') * (2Tr)-i f e-ii*di 

to four places of decimals. In order to ensure four-figure accuracy the 1 
part of the table has been given at intervals of -005 in A, exactly like the 
part of the table of common logarithms (see the explanation of Table 2 f< 
fuller details). Thus opposite "i z” in the left-hand margin, in column •4 
will be found P(i-24) = -8925. and underneath this is printed /Mi^ 245 ) 
-8934. In the part of the table in which this interval of tabulation is used 
mean differences are only needed up to 4 in the last figure, .\dvuntage is t.iken 
of the space available to give more accurate values of mean diflercnces; thus 
opposite “i-2” will be found the mean differences ••2, 4. 6. S" marked 
“(1-20-4)”, that is. applicable when the first three figures of A lie between 
1-20 and 1-24 inclusive, and also the mean differences 2, -j, 5, 7 (1 '25—9) i c. 
applicable when the first three figures of X lie between 1-25 and 1-29 inclusive. 


Example. 

To find P (1-243) 

From table, /®(1-240) = 

diff. for 3 

•8925 

0 


/>(i- 243 ) = 

•893 > 

Example. 

To find P (1-248) 

From table, P (1-245) = 

diff. for 3 

•8934 

h 


P(|- 248 ) 

•0940 


Table 5—Significance points of the sample correlation coenicient r 

This table gives, for sample number n not exceeding 102 the v^alues whief 
r must exceed in absolute magnitude in order to be judged significant at the 
•05 and -oi levels. Thus when there are n =■ pairs of obser%ations a value 
of r exceeding -30 in absolute magnitude (such as r — -35 or ' 35 ) be 
judged significant at the 05 level; and if |r| > - 39 . then r is significant at the 
01 level. A value of r which is not significant at the 05 level can be reasonably 
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explained as occurring by chance even when the v^abl^ concerned we in 
reality independent or uncorrelated. A value which is significant at the -05 
level gives a reasonable presumption of the existence of correlauon, and one 

significant at the 01 level is very good evidence for coxT^ation. 

For values of n greater than 100 the obsen'ed coefficient will U judged 
significant at the -05 level if it exceeds and at the -oi level if it ex¬ 
ceeds 2*‘;8/v''n (in absolute value). _ , .. 

The number m — a is often called the “number of degrees of free¬ 
dom” of T, and is also given in the table. It is useful in applications, such as 
partial correlations, which are beyond the scope of this book. 


Table 6—Significance points of x‘ (chi-squared) 

This table gives the significance points for x* et the -05 and -oi levels up 
to 30 “degrees of freedom” v (see Section 21.4). A value of x* which exceeds 
the appropriate point is judged significant; e.g. for v = 24, a value 01 x — 
40-2 will be significant at the *05 level but not at the *01 level. . , , 

When there are more than 30 degrees of freedom v, x* w"i be judged 
significant at the -05 level if Vx* — V(»’ — i) > *‘ 39 . and at the -oi level if 

- \/(^ - i) > *‘83. 


Table 7—Significance points for the variance ratio F and for 
“Student’s” t 

This table is used in the “Anab'sis of Variance” (see Section 21.5). If 
nisq. and rnsq^ are two “mean squares” with vj and v, “degi^es of freedom 
respectively, then F is defined to be the ratio msqjmsqt. The two parts ot 
this table give the significance points of Fat the 'OS and 'Oi levels respective y. 
A value of F exceeding the appropriate point shows that the first mean square, 
msqt. is significantly greater than the second m^n square, nisq^; or in the 
application discussed in Section 21.5, that there is probably a true difference 

between the means in question. . ^ r .*e_ j . 

In the last column there is given the significance point for Student s t. 
Thin can be considered as the square root of F when - i: it is used in 
testing for a real difference between nvo means. A value of t exceeding the 

given point in absolute value is judged significant. •. 1 

Note that the higher values of »•» and are chosen in such a way that 24/^1 
and 120,'. 3 are whole numbers. This enables intermediate values to be found. 
Suppose for example it was necessary to find the -05 point for F when - i. 

48. Now in the column vj = 1 we find the values F= 408 when 
., = 40. i.e. lao/i-s = 3. and F = 4‘O0 when i-, = 60, i.e. 120/r* - 2. But 
120/48 =* 2-^. and thus, considered in this way, comes half-way bervveen the 
values . 5 = 40 and »•, = 60. The corresponding F will be half-way between 

4 08 and 4-00. i.e. will be 4-04. j t . x. _ 

If both .'1 and are large the following formula must be used. Let ft — 

• 1' 2/(« 1 + ‘ O- 'The" ^ judged significant at the -05 level if 


log F 


i-oi 


\^k - 0*5 


and at the -oi level if 


i '43 

log F > 

\/ k — 0 7 




Table 8—Conversion from r to 2 = tanh"‘ r and conversely 

This table gives the amount, 2 — r, to be added to r to give z, or to be sub¬ 
tracted from z to give r. For higher values use the formulas 


tanh“‘ r = 1*151 log 


I •+• r 


I — r 

— 2/(1 -b antilog *86862) 


r = tanh 2 


= 1 
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Example. To find s when r = -47. From the table we see that the amount 

to be added to r is 04, s = *47 + -04 = .'S*- . 

If Ti and Tj are two correlation coefficients obtained from samples 01 num¬ 
bers n, and respectively, it is possible to test whether they are significantly 
different (i.e. whether the true correlations are different) in the following way. 
which is especially useful when fti and ng are large. 

Calculate i? = (r^ - rB)/(i - r*) and K= [-^L3 + 0,-3] 

Then the difference is significant at the -05 level if Itanh;' > 1-96 AT. and 
at the -oi level if ( tanh"* R \ > 2-58 K. If I i? I < -2. it is good enough to 

take tanh”‘ i? = i?- 




Table i—Greek alphabet 



A 

a 

alpha 

N 

V 

lUi 

B 


beta 



xi 

r 

y 

gamma 

0 

0 

omicron 

A 

s 

delta 

11 

7 T 

pi 

E 

€ 

epsilon 

P 

P 

rh«> 

Z 

C 

zeta 


a 

sigma 

H 

V 

eta 

r 

T 

tan 

0 

e 

theta 

1* 

V 

iipsilon 

I 

i 

iota 

<i) 

•f* 

phi 

K 

K 

kappa 

X 

X 

chi 

A 

X 

lambda 

M” 

0 

psi 

M 


inu 

U 

O} 

omega 
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Table 2—Common logarithms 



0 12 3 

4 5 6 

7 8 9 

t 2 

3 

4 






1*0 

*0000 *0043**0086 *0126 

•0170 *0212 *0253 

•0294 *0334 *0374 

4 8 

13 

17 


P 

•00-4) 



•0022 *0065 *0107 *0149 

•0191 *0233 *0273 

•0314 *0354 *0394 

4 8 

12 

16 


Cl 

•05- 

9) 


1*1 

«0414 *0453 *0492 *0531 

•0569 -0607 *0645 

•0682 *0719 •0755 

4 8 

12 

16 


(1 

•10-2) 



•0434 *0473 -OSII -0550 

•0588 *0626 *0663 

•0700 *0737 *0774 

4 8 

11 

15 


(t 

•13- 

9) 


1*2 

•0792 •0628 *0064 *0899 

•0934 '0969 *1004 

•1030 *1072 *1106 

4 7 

11 

14 


(1 

*20- 

3) 



•0810 *0846 *0881 *0917 

*0952 *0986 *1021 

•1055 •1089 •1123 

3 7 

10 

14 


(1 

•24-9) 


1*3 

•1139 *1173 *1206 *1239 

•1271 *1303 *1335 

•1367 *1399 *1430 

3 7 

10 

13 


(1 

•30- 

3) 



*1156 *1109 *1222 *1255 

*1267 *1319 *1351 

*1383 *1414 *1446 

3 6 

10 

13 


h 

•34-9) 


1*4 

•1451 *1492 *1523 *1553 

*1584 *1614 -1644 

•1673 *1703 *1732 

3 6 

9 

12 







•1477 *1508 *1530 *1569 

*1599 *1029 *1658 

•1688 *1717 *1746 

1 2 

3 

4 

5 

6 

7 

8 

9 

VS 

•1761 *1790 *1818 *1647 

•1875 *1903 *1931 

•1959 *1987 *2014 

3 6 

8 

11 

14 

17 

20 

22 25 

1*6 

•2041 *2068 *2095 *2122 

♦2148 *2175 *2201 

•2227 *2253 *2279 

3 5 

8 

It 

13 

16 

18 

21 

24 

1*7 

•2304 *2330 *2355 *2380 

♦2405 *2430 *2455 

•2480 *2504 *2529 

2 5 

7 

10 

12 

15 

17 

20 22 

1*8 

•2553 *2577 *2601 *2625 

•2648 *2672 *2695 

*2718 *2742 *2765 

2 5 

7 

9 

12 

14 

16 

19 21 

1*9 

*2768 *2810 *2833 *2856 

•2878 *2900 *2923 

♦2945 *2967 *2989 

2 4 

7 

9 

11 

13 

16 

10 

20 

2*0 

•3010 *3052 ♦3054 *3075 

*3096 *3118 *3139 

•3160 •3181 *3201 

2 4 

6 

a 

11 

13 

15 

17 

19 

2*1 

*3222 *3243 *3263 *3264 

•3304 *3324 *3345 

*3365 '3385 *3404 

2 4 

6 

8 

10 

12 

14 

16 

18 

2*2 

•3424 *3444 *3464 *3483 

•3502 *3522 *3541 

*3560 *3579 *3598 

2 4 

6 

8 

10 

12 

14 

15 

17 

2*3 

*3617 *3636 *3655 *3674 

*3692 *3711 *3729 

•3747 *3766 *3784 

2 4 

6 

7 

9 

11 

13 

15 

17 

2*4 

*3802 *3820 *3838 *3856 

♦3874 *3892 *3909 

♦3927 *3945 *3962 

2 4 

5 

7 

9 

tt 

12 

14 

16 

2*3 

*3979 *3997 *4014 *4031 

♦4048 *4065 *4082 

*4099 *4116 *4133 

2 3 

$ 

7 

9 

10 

12 

14 

15 

2*6 

•4150 *4166 *4183 *4200 

*4216 *4232 *4249 

♦4265 *4261 *4298 

2 3 

S 

7 

8 

10 

tt 

13 

15 

2*7 

•4314 *4330 *4346 *4362 

•4378 *4393 ♦4-109 

•4425 *4440 *4456 

2 3 

s 

6 

8 

9 

It 

13 

14 

2*8 

•4472 *4487 *4502 *4518 

♦4533 *4546 *4564 

•4579 *4594 *4609 

2 3 

s 

6 

8 

9 

11 

12 

14 

2*9 

•4624 ^4639 *4654 *4669 

*4683 *4698 *4713 

*4728 '4742 *4757 

1 3 

4 

6 

7 

9 

to 

12 

13 

3*0 

*4771 *4706 *4800 *4814 

•4029 *4843 *4857 

♦4871 *4886 *4900 

1 3 

4 

6 

7 

9 

to 

11 

13 

3*1 

•49M *4928 *4942 *4955 

*4969 -4983 *4997 

*5011 *5024 •5038 

1 3 

4 

6 

7 

8 

to 

11 

12 

3*2 

•5051 *5065 *5079 *5092 

•5105 *5119 *5132 

•5145 *5159 *5172 

1 3 

4 

5 

7 

8 

9 

11 

12 

3*3 

•5185 *5190 *5211 *5224 

•5237 *5250 *5263 

•5276 *5289 *5302 

1 3 

4 

5 

6 

8 

9 

10 

12 

3*4 

•5315 *5328 *5340 *5353 

•5366 *5370 *5391 

•5403 *5416 *5428 

1 3 

4 

5 

6 

8 

9 

10 

11 

3*5 

•5441 *5453 *5465 *5478 

*5490 *5502 *5514 

♦5527 *5539 *5551 

1 2 

4 

5 

6 

7 

9 

to 

11 

3*6 

•5563 *5575 *5587 *5599 

•5611 *5623 *5635 

•5647 *5658 *5670 

1 2 

4 

5 

6 

7 

8 

10 

11 

3*7 

•5662 *5694 *5705 *5717 

*5729 *5740 *5752 

•5763 *5775 *5786 

1 2 

3 

5 

6 

7 

8 

9 

10 

3*0 

•5798 *5809 *5621 *5832 

*5843 *5855 *5066 

•5877 *5888 *5899 

I 2 

3 

S 

6 

7 

8 

9 

to 

3*9 

•5911 *5922 *5933 *5944 

*5955 *5966 *5977 

*5988 *5999 *6010 

1 2 

3 

4 

S 

7 

8 

9 

to 

4*0 

♦6021 *6031 *6042 *6053 

•6064 *6075 *6085 

•6096 *6107 *6117 

t 2 

3 

4 

s 

6 

8 

9 

to 

4*1 

•6128 *6138 *6149 *6160 

•6170 •6180 *6191 

*6201 *6212 *6222 

t 2 

3 

4 

s 

6 

7 

8 

9 

4*2 

*6232 *6243 *6253 *6263 

'6274 *6284 *6294 

*6304 *6314 *6325 

1 2 

3 

4 

s 

6 

7 

6 

9 

4*3 

•6335 *6345 *6355 *6365 

•6375 *6385 *6395 

*6405 *6415 '6425 

1 2 

3 

4 

s 

6 

7 

8 

9 

4*1 

•6435 *6444 *6454 *6464 

•6474 *6484 *6493 

*6503 *6513 *6522 

1 2 

3 

4 

s 

6 

7 

6 

9 

4*5 

•6532 *6542 *6551 *6561 

*6571 '6580 *6590 

*6599 *6609 *6618 

1 2 

3 

4 

s 

6 

7 

8 

9 

4*6 

•6628 *6637 *6646 *6656 

*6665 *6675 *6684 

•6693 *6702 *6712 

1 2 

3 

4 

s 

6 

7 

7 

6 

4*7 

•6721 *6730 '6739 *6749 

•6758 *6767 *6776 

•6785 *6794 *6803 

1 2 

3 

4 

5 

5 

6 

7 

8 

4*P 

'6012 *6821 -6830 *6839 

*6848 *6857 '6866 

*6875 *6084 *6893 

1 2 

3 

4 

4 

5 

6 

7 

8 

4*9 

'6902 *6911 *6920 *6928 

•6937 *6946 *6955 

•6964 *6972 *6981 

1 2 

3 

4 

4 

5 

6 

7 

8 
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Table a—Common logarithms— 



0 12 3 

A S 6 

7 0 9 

5*0 

3M 

5*2 

5*3 

5M 

•6990 *6998 *7007 ’7016 
•7076 *7084 ’7093 *7101 
•7160 •7168 *7177 •718S 
•7243 *7261 •7259’ *7267 
•7324-•7332 *7340 *7340 

•7024 *7033 *7042 
•7110 •7110 *7126 
•7193 *7202 *7210 
•7275 *7204 *7292 
•7336 •7364 •7372 

•7050 *7059 -7067 
•7135 *7143 -7152 
•7210 *7226 -7235 
•73CO *7308 -7316 
•7300 *7380 -7396 

3*5 

5*6 

S^7 

S^S 

S ^9 

•7404 *7412 ‘7419 -7427 
•7482 ‘7490 *7497 -7305 
•7559 *7566 '7574 *7582 
•7634 *7642 *7649 *7657 
•7709 *7716 *7723 *7731 

•7435 *7443 *7451 
•7513 '7520 -7520 
•7509 *7597 *7604 
•7664 *7672 -7679 
•7738 *7745 •7752 

•7459 *7466 *7474 
-7536 *7543 *7551 
•7612 -7619 *7627 
•7606 *7694 *7701 
•7760 -7767 *7774 

6*0 

6M 

6*2 

6*3 

6*4 

•7782 *7789 *7796 ‘7803 
•7853 *7860 '7868 ‘7875 
•7924 *7931 ‘7938 *7945 
•7993 *8000 *8007 'eOM 
•8062 ‘8069 *8075 *0082 

•7010 -7010 *7025 
•7002 *7809 *7096 
•7952 -7959 *7966 
•0021 '8020 *0035 
•8009 *6096 ^8102 

•7032 *7839 *7046 
•7903 -7910 *7917 
•7973 -7960 •7987 
•6041 •6048 *8055 
•6109 -0116 *0122 

6*5 

6*6 

6*7 

6*8 

6*9 

•0129 *8136 ■8142 '8149 
•8195 •0202 '8209 *8215 
•6261 *8267 '8274 •8280 
•8325 *8331 *8338 '8344 
•6388 '8395 '8401 *8407 

•0156 -0162 *0169 
•8222 -0220 *0235 
•8287 *6293 *0299 
•0351 *6357 *6363 
•0414 *6420 •6426 

•0176 *0162 *8169 
•0241 *0240 *8254 
*8506 *0312 *0319 
•0370 -8376 •8362 
•0432 *0439 *0445 

7*0 

7*1 

7*2 

7*3 

7*4 

•8451 *8457 '6463 *8470 
•8513 *8519 ‘8525 '6531 
•8573 ‘8579 '8585 '8591 
•0633 '8639 *8645 *8651 
•8692 '8696 *8704 '8710 

•0476 •0402 'SiSS 
•6537 •0543 *0549 
•8597 *6603 *6609 
•0657 •0663 -8669 
•0716 *0722 *0727 

•8494 *6500 •0506 
•0555 *8561 •0567 
•6615 -8621 -0627 
•8675 *8661 •0666 
•0733 -8739 *0745 

7*5 

7*6 

7.7 

7*8 

7*9 

•8751 •8756 *8762 *6768 
•8806 *8014 '6820 '6625 
•8865 *8671 '8676 *0662 
•8921 '8927 '8932 '8938 
•8976 *8982 *8987 '8993 

•6774 -0779 -6705 
•6031 -0637 *0042 
•0607 •0093 •0099 
•0943 '0949 •0954 
•8990 *9004 *9009 

•0791 -0797 -0002 
•0840 -0054 -8059 
*0904 -0910 *0915 
•0960 *0965 -097! 
*9015 *9020 *9025 

6:0 

8*1 

8*2 

8*3 

6*4 

•9031 *9036 *9042 '9047 
•9085 *9090 '9096 '9101 
•9130 *9145 *9149 '9154 
•9191 *9196 *9201 '9206 
•9243 *9246 '9253 '9258 

*9053 *9056 *9063 
•9106 *9112 *9117 
•9159 *9165 -9170 
•9212 *9217 *9222 
•9263 -9269 *9274 

•9069 *9074 *9079 
•9122 •9126 -9133 
•9175 *9100 ♦9166 
•9227 *9232 *9230 
•9279 *9284 *9269 

8*5 

8*6 

8*7 

6*8 

8*9 

•9294 *9299 *9304 *9309 
•9345 *9350 *9355 '9360 
•9395 *9400 *9405 '9410 
'9445 *9450 '9455 *9460 
•9494 *9499 *9504 *9509 

•9315 *9320 -9325 
•9365 -9370 -9375 
•9415 -9420 -9425 
•9465 -9469 *9474 
•9513 *9518 *9523 

•9330 •9335 •93>0 
•9300 -9365 -9390 
•9430 -9435 *9440 
•9479 •9404 *9409 
•9520 *9533 *9530 

9*0 

9*1 

9*2 

9*3 

9*4 

•9542 *9547 *9552 *9557 
•9590 • 9 S 95 *9600 *9605 
•9638 *9643 *9647 *9652 
•9665 '9689 *9694 *9699 
♦9731 *9736 *9741 *9745 

•9562 *9566 *9571 
•9609 *9614 *9619 
•9657 *9661 *9666 
•9703 -9700 *9713 
•9750 *9754 -9759 

•9576 *95fi1 *9566 
•9624 *9626 *9633 
•9671 •9675 *9680 
•9717 -9722 *9727 
•9763 -9700 *9773 

9*5 

9*6 

9*7 

9*6 

9*9 

•9777 *9782 *9766 *9791 
•9823 ‘9827 *9832 *9636 
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•9956 *9961 '9965 '9969 

♦9795 -9600 *9005 
•9041 *9645 *9050 
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Table 3—Natural sines 


0 




6 


7 8 9 < 2 T 4 5 6 7 


0 *0000 *0017 *0035 *0052 

1 *0175 *0192 *0209 *0227 

2 ‘0349 *0366 *0384 *0401 

3 ’OSZl ‘0541 *0538 '0576 

4 *0698 -0715 *0732 *0750 


s 

*0872 *0889 *0906 

6 

•1045 *1063 *1000 

7 

*1219 *1236 *1253 

8 

•1392 *1409 *1426 

9 

•1564 *1582 *1599 

10 

•1736 -1754 '1771 

t1 

•1908 *1925 *1942 

12 

•2079 '2096 '2113 

13 

•2250 *2267 '2284 

14 

•2419 '2436 '2453 

15 

*2588 '2605 '2622 

16 

•2756 '2773 *2790 

17 

•2924 '2940 *2957 

19 

•3090 *3107 '3123 

19 

•3256 '3272 *3289 

20 

•3420 *3437 ■3453 

21 

•3584 *3600 ‘3616 

22 

'3746 '3762 -3778 

23 

•3907 *3923 *3939 

24 

•4067 ‘4003 -4099 

25 

•4226 *4242 ‘4258 

26 

•4384 ’4359 ’4415 

27 

•4540 *4555 ••1571 

29 

•4695 ‘4710 •4726 

29 

•4046 ‘4863 *4879 

30 

•5000 *5015 *5030 

31 

•5150 *5165 *5180 

32 

•5299 *5314 *5329 

33 

'5446 ‘5461 *5476 

34 

•5592 ’5606 '5621 

35 

•5736 '5750 *5764 

36 

'5878 *5892 *5906 

37 

•6018 *6032 *6046 

38 

• 6157 -eno *6184 

39 

•6293 *6307 *6320 


0924 

1097 

1271 

1444 

’1616 

•1788 

■1959 

•2130 

•2300 

'2470 

•2639 

■2807 

•2974 

•3140 

■3305 

'3469 

•3633 

•3795 

•3955 

•4115 

•4274 

•4431 

•4566 

•4741 

•4894 

•5045 

•5195 

•5344 

•5490 

•5635 

•5779 

'5920 

•6060 

•6198 

•6334 


40 

41 

42 

43 

44 


6428 '6441 
6561 *6574 
6691 •6704 
6820 ‘6833 
•6947 '6959 


6455 ‘6466 
6587 ‘6600 
6717 -6730 
6645 '6858 
6972 *6984 


•0070 *0087 *0105 
•0244 '0262 '0279 
'0419 ’0436 *0454 
•0593 '0610 '0628 
•0767 '0785 *0802 

•0941 ‘0958 -0976 
•1115 '1132 *1149 
•1288 '1305 '1323 
'1461 -1478 '1495 
•1633 '1650 '1668 

•1605 *1822 '1840 
'1977 ‘1994 *2011 
•2147 '2164 '2181 
•2317 '2334 ‘2351 
•2487 ‘2504 '2521 

'2656 '2672 '2689 
•2823 •28*10 '2857 
'2990 '3007 *3024 
■3156 '3173 ‘3190 
•3322 -3330 *3355 

•3486 '5502 '3518 
'3649 *3665 '3681 
'3011 '3827 •3843 
•3971 ‘3987 '4003 
■4131 '4147 ‘4163 

'4289 ‘4305 *4321 
•4446 '4462 '4478 
•4602 '4617 '4633 
♦4756 '4772 '4787 
•4909 ’4924 ‘4939 

•5060 '5075 -5090 
•5210 ^5225 ‘5240 
•5358 *5373 '5388 
•5505 '5519 *5534 
•5650 '5664 *5678 

•5793 '5807 '5821 
•5934 '5948 '5962 
•6074 *6088 '6101 
•6211 '6225 ‘6239 
'6347 '6361 ‘6374 

•6401 ‘6494 *6508 
•6613 ‘6626 ‘6639 
♦6743 *6756 ‘6769 
•6671 •6884 -6896 
•6997 ‘7009 ‘7022 


0122 '0140 *0157 
0297 *0314 '0332 
0471 *0488 *0506 
0645 -0663 !0680 
0819 *0837 "0854 

’0993 '1011 '1020 
•1167 *1164 '1201 
.1340 '1357 '1374 
•1513 '1530 '1547 
•1685 '1702 '1719 

•1857 '1875 '1891 
■2020 *2045 *2062 
•2198 '2215 '2233 
•2368 *2385 *2402 
'2538 '2554 -2571 

•2706 *2723 *2740 
•2074 *2090 ‘2907 
•3040 -3057 '3074 
•3206 '3223 '3239 
•3371 '3387 '3404 

'3535 *3551 '3567 
♦3697 '3714 *3730 
•3859 *3875 *3891 
•4019 *4035 *4051 
■4179 *4195 ^4210 

•4337 *4352 *4368 
•4493 ‘4509 *4524 
•4648 *4664 *4679 
*4802 *4818 '4833 
•4955 '4970 "4985 

'51CS '5120 '5135 
•5255 '5270 -5284 
'5402 '5417 '5432 
•5548 •5563 •5577 
•5693 •5707 ‘5721 

•5835 •5850 '5864 
•5976 ‘5990 '6004 
•6115 •6129 *6143 
•6252 *6266 *6280 
*6388 *6401 *6414 

•6521 -6534 *6547 
•6652 *6665 '6676 
•6782 *6794 *6807 
•6909 '6921 *6934 
•7034 *7046 ’7059 
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Table "i—Natural sines (continued) 


0 12 3 4 5 6 


45 *7071 •7083 *7096 *7108 *7120 *7133 ‘7145 

46 *7193 *7206 ♦7218 *7230 *7242 *7254 *7266 

47 *7314 *7325 *7337 •7349 *7361 *7373 *7385 

48 *7431 *7443 -7455 •7466 *7478 ^7490 ♦7501 

49 *7547 •7559 ’7570 •7581 ♦7593 *7604 ‘7615 

50 *7660 *7672 •7663 *7694 •7705 •7716 ♦7727 

51 •7771 •7782 •7793 •7804 •7815 *7826 ■7837 

52 •7880 ♦7891 *7902 -7912 •7923 *7954 *7944 

53 •7986 •7997 •8007 -8018 •8028 *8039 ^8049 

54 *8090 ‘BlOO -81M •8121 •8131 -8141 ♦8151 

55 ♦8192 ♦8202 •8211 •8221 •8231 •8241 '8251 

56 •8290 '8300 *8310 '8320 •8329 •8339 •8348 

57 •8387 •8396 •8406 ‘8415 *6425 *8434 *8443 

58 ^8480 *8490 •8499 '8508 *8517 •8526 *8536 

59 *8572 *8581 *8590 *8599 *8607 *8616 -8625 

60 *8660 ’8669 *8678 *8686 -8695 *8704 *8712 

61 *8746 *8755 *8763 *8771 *8780 *8788 *8796 

62 ♦8829 •8838 *8846 •8854 *8862 *8870 *8878 

63 ♦8910 •8918 •8926 *8934 *8942 *8949 *8957 

64 •8988 *8996 *9003 *9011 *9018 *9026 *9033 

65 *9063 *9070 *9078 *9085 *9092 *9100 *9107 

66 *9135 *9143 ’9150 *9157 *9164 *9171 *9178 

67 *9205 *9212 *9219 *9225 *9232 *9239 *9245 

68 ♦9272 *9270 *9285 *9291 *9298 *9304 *9311 

69 *9336 *9342 *9348 *9554 *9361 *9367 *9373 

70 *9397 *9403 *9409 *9415 *9421 *9426 *9432 

71 *9455 *9461 *9466 *9472 *9478 *9483 *9489 

72 *9511 *9516 *9521 *9527 *9532 *9537 *9542 

73 *9563 *9568 *9573 *9578 *9585 *9568 *9593 

74 *9613 ♦9617 *9622 *9627 *9632 *9636 *9641 

75 *9659 *9664 *9668 *9673 *9677 *9681 *9686 

76 '9703 *9707 *9711 *9715 *9720 *9724 *9720 

77 *9744 *9748 *9751 *9755 *9759 *9763 *9767 

78 *9781 *9785 *9789 *9792 *9796 *9799 *9805 

79 *9016 *9820 *9823 *9826 •9829 *9833 *9836 

80 *9848 *9851 *9854 *9857 *9860 ‘9063 *9866 

81 *9877 *9880 *9882 *9085 *9800 *9890 *9893 

82 *9903 *9905 *9907 *9910 *9912 *9914 *9917 

83 *9925 *9920 *9930 *9932 *9934 *9936 *9938 

84 *9945 *9947 *9949 *9951 *9952 *9954 *9956 

05 '9962 *9963 *9965 *9966 *9968 *9969 *9971 

86 *9976 *9977 *9970 •9979 '9980 *9981 *9902 

87 *9986 *9987 *9980 *9989 *9990 *9990 *9991 

88 *9994 *9995 *9995 *9996 *9996 *9997 -9997 

89 *9998 *9999 *9999 '9999 *9999 1*00 1*00 


7 8 9 123456 789 


7157 *7169 *7181 

1 

2 

4 

5 

t 

7 

9 

10 

1 f 
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Table 4—Normal or gaussian integral P(X) 



0 12 3 

4 5 6 

0*0 

OM 

0*2 

0*3 

0*4 

•5000 *5040 *5080 *5120 
•5398 *5438 '5478 ‘SSI? 
•5793 *5832 *5871 *5910 
•6179 *6217 *6255 •6293 
•6554 *6591 •6628 •6664 

•5160 •5199 *5239 
•5557 *5596 ’5636 
•5948 •5987 •6026 
•6331 •6368 '6406 
•6700 *6736 •6772 

0*3 

•69tS *6950 *6985 *7019 
•6932 *6967 *7002 *7037 

•7054 ‘7088 •7123' 
•7071 *7106 *7140 

0*6 

•7257 -7291 *7324 -7357 
•7274 *7307 *7340 ‘7373 

•7389 *7422 ^7454 
♦7405 •7438 *7470 

0*7 

•7560 ’7611 •7642 *7673 
•7596 *7627 •7658 •7688 

•7704 *7734 •7764 
•7719 *7749 •7779 

o*a 

•7881 *7910 *7939 -7967 
•7896 *7925 ‘7933 ‘7981 

•7995 ■8023 •8051 
•8009 •8037 •8065 

0*9 

•8159 •8166 ‘8212 '8238 
•8173 *6199 *8225 *8251 

•6264 *8289 *8315 
•8277 *8302 *8327 

1*0 

•8413 *8438 *8461 •8485 
•8425 '8449 •8473 *8497 

•6508 *8531 *8554 
•8520 *6543 '8566 

t»l 

•8643 •0665 •8666 *8708 
•8654 •6676 •8697 *8718 

•8729 •8749 •8770 
•8739 *8760 *8780 

1«2 

•8849 •8869 •8688 •8907 
•8859 -8878 •8897 *8916 

•8925 *8944 •8962 
■8934 *8953 *8971 

1*3 

■9032 *9049 *9066 •9082 
•9041 ‘9057 *9074 *9091 

•9099 *9115 •9131 
•9107 *9123 ‘9139 


1*4 

•9192 

*9200 

•9207 *9222 *9236 
•9215 *9229 *9244 

•9251 •9265 *9279 
•9258 '9272 -9285 

1*5 

*9332 

•933$ 

•9345 *9357 *9370 
•9351 *9364 ‘9376 

•9382 *9394 *9406 
•9388 '9400 *9412 

1*6 

*9432 

*9436 

•9463 *9474 *9484 
•9468 *9479 *9490 

•9495 *9505 *9515 
•9500 *9510 *9520 

1-7 

*9354 

•9559 

•9564 •9573 •9582 
■9568 *9577 *9586 

•9591 *9599 *9608 
•9595 *9604 *9612 

i«e 

•9641 

•9645 

•9649 *9656 *9664 
•9652 *9660 •9667 

•9671. '9678 •9686 
•9675 *9682 ‘9689 

1 •a 

*9713 

•9716 

•9719 *9726 *9732 
•9723 *9729 *9735 

•9736 •9744 *9750 
•9741 *9747 '9753 

2*0 

2*r 

2*2 

2*3 

2*4 

•9772 

.*9821 

•9861 

*9893 

*9918 

•9778 *9783 •9788 
■9026 ’9830 •9834 
•9864 *9868 *9871 
•9896 *9896 *9901 
•9920 *9922 '9925 

•9793 ‘9798 •9803 
•9838 *9842 *9846 
•9875 •9878 *9881 
•9904 *9906 *9909 
•9927 *9929 *9931 

2*5 

2*6 

2*7 

2*8 

2*9 

*9930 

•9953 

•9965 

*9974 

•99d1 

•9940 *9941 *9943 
•9955 *9956 '9957 
•9966 *9967 *9968 
•9975 *9976 *9977 
•9982 *9982 *9983 

•9945 *9946 *9948 
•9959 *9960 *9961 
•9969 *9970 *9971 
•9977 •9978 *9979 
•9984 ’9984 *9985 


a 1 2 3 4 5 6 7 8 9 


•5279 *5319 •5359 

4 

8 12 

16 20 

24 28 32 36 

•5675 *5714 *5753 

4 

8 12 

16 20 

24 28 32 36 

•6064 '6103 '6141 

4 

8 12 

15 19 

2 3 27 31 35 

•6443 *6480 '6517 

4 

$ * 

11 

15 19 

22 26 30 34 

•6806 '6844 *6879 

4 

7 

n 

14 18 

22 25 29 32 

•7157 *7190 ^7224 

3 

7 

10 

14 

(0*50^6) 

•7174 •7207 ^7241 

3 

7 

ID 

13 

tO'S7-9) 

•7486 *7517 *7549 

3 

7 

10 

13 

(0*60-6) 

•7502 *7533 •7565 

3 

6 

9 

15 

(0'67-9) 

•7794 '7823 ‘7852 

3 

6 

9 

12 


•7808 •7858 *7667 






•8078 '8106 '8133 

3 

6 

8 

11 

tO‘80-5) 

•8092 '8119 *8146 

3 

5 

a 

11 

10•66-9) 

•8340 *8365 '8389 

t 

S 

8 

to 


•8352 •8377 ^8401 

2 

5 

7 

10 

(0'97-9) 

•8577 *8599 •8621 

2 

3 

7 

9 


•8588 *8610 *8632 

2 

4 

7 

9 

(l*05-9) 

•6790 •8810 *8830 

2 

4 

8 

9 

(1'10-1) 

•8600 ‘8820 •6640 

2 

4 

6 

6 

(1*12-9J 

•8980 *8997 *9015 

2 

4 

6 

$ 


•8988 *9006 *9023 

2 

4 

5 

7 

(l*25-9) 

•9147 •9162 *9177 

2 

3 

5 

7 

(1*30-3) 

•9154 '9170 •9105 

2 

3 

3 

6 

(1*34-9) 

•9292 *9306 •9319 

1 

3 

4 

6 

(1*40-6) 

•9299 *9312 *9325 

1 

3 

4 

5 

(1*47-9/ 

•9418 ‘9429 ‘9441 

1 

3 

4 

5 

(1*50-7) 

•9424 '9435 *9446 

t 

2 

3 

5 

(1*58-9) 

•9525 •9535 *9545 

1 

2 

3 

4 


•9530 '9540 *9550 






•9616 '9625 '9633 

t 

2 

3 

4 

(1-70-4) 

•9620 *9629 *9637 

1 

2 

2 

3 

(1■75-9) 

•9693 *9699 '9706 

1 

2 

2 

3 

(1*80-1) 

•9696 ■9703 •9’10 

1 

1 

2 

3 

t1*82-9) 

•9756 •9761 *9767 

1 

t 

2 

3 

(1*90-1) 

•9759 *9764 *9770 

1 

1 

2 

2 

(1'92-9) 

1 

2 

3 

4 i 

,6 7 8 9 

•9808 ‘9812 *9817 

0 

“T 


2 1 

13 3 4 4 

•9650 *9854 *9857 

0 

1 

1 

2 : 

\ 2 3 3 4 

•9884 *9887 *9090 

0 

t 

1 

X : 

\ 2 2 3 3 

•9911 *9913 *9916 

0 

0 

1 

1 1 

It 2 2 2 

•9932 ‘9934 '9936 

0 

0 

1 

1 1 

[ 1 12 2 

•9949 *9951 *9952 

0 

0 

0 

1 1 

ft tit 

1 « « 4 4 

•9962 '9963 *9964 

0 

0 

0 

0 1 

11 111 

a « • 

•9972 •9973 *9974 

0 

0 

0 

0 U 1 • * ! 

•9979 *9980 ‘9981 

0 

0 

0 

0 0 0 oil 

•9985 *9986 *9986 

0 

0 

0 

0 0 0 0 0 1 


X 3*0 3M 3-2 5*3 3’4 3-5 3*6 3‘7 3'8 5'9 

PlX) ‘9987 *9990 '9993 '9995 *9997 *9998 '9998 '9999 *9999 t-0000 


Thit table « derived from Appendix Table 2 of G. U. Yule and M. G. Kendalls 
'•An Introduction to the Theory of Statisties" by kind permission of the authors 
and the publishers, Messrs. Charles Griffin & Co. Ltd. 
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Table 5—Significance points of 
the sample correlation 
coefficient r 


Correlation coefficient r 


SAMPLe 

DB3. OF 

•05 

•01 

NUHBS 

FR£nX»H 

POIKT 

POIKT 

n 

V 



s 

3 

•88 

•96 

6 

4 

*61 

•92 

7 

5 

•7S 

•87 

6 

6 

•71 

•63 

9 

7 

•67 

•60 

10 

8 

•63 

•76 

It 

9 

*60 

•73 

12 

10 

•58 

•71 

13 

11 

•55 

•68 

14 

12 

•53 

*66 

15 

13 

•51 

*64 

16 

14 

•50 

*62 

17 

15 

•46 

*61 

18 

16 

•47 

•59 

19 

17 

*46 

•56 

20 

18 

•44 

*56 

21 

19 

•43 

•55 

22 

20 

*42 

•54 

27 

25 

•38 

•49 

32 

30 

•35 

•45 

37 

35 

*32 

*42 

42 

40 

•30 

•39 

47 

45 

*29 

•37 

52 

50 

•27 

•35 

62 

60 

•25 

*32 

72 

70 

•23 

•30 

62 

60 

•22 

*26 

92 

90 

•2t 

• 27 

102 

100 

•19 

*25 


Table 6—Significance points of 
X* (chi-squared) 


Otl-squer«d 


DEG. OF -OS -01 

FREn»M POINT POINT 

V 


1 

3*64 

6*64 

2 

5 *99 

9'2I 

3 

7*02 

t t *34 

4 

9*49 

1 3*28 

5 

11 *07 

1 5*09 

6 

12*59 

16 *81 

7 

14*07 

18*48 

6 

15*51 

20*09 

9 

16*92 

2t -67 

10 

16*31 

23*21 

11 

19*66 

24*73 

12 

21 *03 

26-22 

1 3 

22*36 

27 *69 

14 

23-69 

29 *14 

15 

25*00 

30-56 

16 

26*30 

32-00 

17 

27-59 

33-41 

16 

26-87 

34-01 

19 

30 *14 

36 *19 

20 

31 '41 

37 * 57 

21 

32*67 

38-93 

22 

33*92 

40*29 

23 

35*17 

41 *64 

24 

36-42 

42 96 

25 

37 '65 

44-31 

26 

38-89 

45-64 

27 

40 * 1 1 

46-96 

26 

41 *34 

48 -26 

29 

42*56 

49 *59 

30 

43*77 

50-89 


Tabu 5 fi abridged from TubU l ^ of Siathtuof Mrthods /or 
by hind prrmisuon of the author. Prof. Sir P. A I ishfr, 

Mrssn. Oiivrr and lioyd. 


lUirarcft M'ttrhrrs** 
ond ih( puhliihtrt. 


Tabu 6 u abridg<d]tom TabU III of "Staihtuol S- 

by kind prmtission of ihe author. Prof. Str H. A. iuher, 

Mturt, O/lirr and Uoyd. 


lititaf<h Wvtheti** 
and (hr pubUshm, 
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Table 7— Significance points for the variance ratio F and for 

“Student’s” t (05 points) 



2 18*5 

3 10*1 

4 7*71 

5 6*61 

6 5*99 

7 5*59 

e 5*32 
9 5*12 


2 3 


200 216 
19*0 19*2 

9*S5 9*28 
6*94 6*59 

5*79 5*41 
5*14 4*76 
4*74 4*35 
4*46 4*07 

4*26 3*66 



10 

4*96 


11 

4*84 


12 

4*75 


13 

4*67 


14 

4*60 


15 

4*54 


16 

4*49 


17 

4*45 


18 

4*41 


19 

4*38 


20 

4*35 


21 

4*32 


22 

4*30 


23 

4*28 


24 

4*26 


25 

4*24 

120 

26 

4*22 

27 

4*21 

yi 

28 

4*20 


29 

4*18 

4I 

30 

4*17 

3 

40 

4*08 

2 

60 

4*00 

1 

120 

3*92 

0 

CO 

3*84 


4*10 

3*71 

3*98 

3*59 

3*88 

3*49 

3*80 

3*41 

3*74 

3*34 

3*68 

3*29 

3*63 

3*24 

3*59 

3*20 

3*55 

3*16 

3*52 

3*13 

3*49 

3*10 

3*47 

3*07 

3*44 

3*05 

3*42 

3*03 

3*40 

3*01 

3*38 

2*99 

3*37 

2*98 

3*35 

2*96 

3*34 

2*95 

3*33 

2*93 

3*32 

2*92 

3*23 

2*84 

3*15 

2*76 

3*07 

2*68 

2*99 

2*60 



4 


225 

19*3 

9*12 

6*39 

5*19 

4*53 

4*12 

3*84 

3*63 

3*48 

3*36 

3*26 

3*18 

3*11 

3*06 

3*01 

2*96 

2*93 

2*90 

2*87 

2*84 

2*82 

2*80 

2*78 

2*76 

2*74 

2*73 

2*71 

2*70 

2*69 

2*61 

2*52 

2*45 

2*37 


2A/V^ 

5 


230 

19*3 

9*01 

6*26 

5*05 

4*39 

3*97 

3*69 

3*48 

3*33 

3*20 

3*11 

3*02 

2*96 

2*90 

2*85 

2*8T 

2*77 

2*74 

2*71 

2*68 

2*66 

2*64 

2*62 

2*60 

2*59 

2*57 

2*56 

2*54 

2*53 

2*45 

2*37 

2*29 

2*21 


4 


6 


234 

19 * 3 . 

8*94 

6*16 

4*95 

4*28 

3*87 

3*58 

3*37 

3*22 

3*09 

3*00 

2*92 

2*85 

2*79 

2*74 

2*70 

2*66 

2*63 

2*60 

2*57 

2*55 

2*53 

2-51 


3*42 

2*34 

2*25 

2*17 

2*09 


8 


239 

19*4 

8*84 

6*04 

4*82 

4*15 

3*73 

3*44 

3*23 

3*07 

2*95 

2*65 

2*77 

2*70 

2*64 

2*59 

2*55 

2*51 

2*48 

2*45 

2*42 

2*40 

2*38 

2*36 


2*27 
2*18 
2*10 
2*02 
1 *94 




244 

19*4 

8*74 

5*91 

4*68 

4*00 

3*57 

3*28 

3*07 

2*91 

2*79 

2*69 

2*60 

2*53 

2*48 

2*42 

2*38 

2*34 

2*31 

2*28 

2*25 

2*23 

2*20 

2*18 

2*16 

2*15 

2*13 

2*12 

2*10 

2*09 
2*00 
1 *92 
1*83 
1 *75 


2*49 2*34 

2*47 • 2*32 
2*46. 2*30 
2*44 2*29 

2*43 2*28 


1 O 


24 

<0 


249 

254 

12*7 

19*5 

19*5 

4*30 

8*64 

8*53 

3*18 

5*77 

5*63 

2*78 

4*53 

4*36 

2*57 

3*84 

3*67 

2*45 

3*41 

3*23 

2*37 

3*12 

2*93 

2*31 

2*90 

2*71 

2*26 

2*74 

2*54 

2*23 

2*61 

2*40 

2*20 

2*50 

2*30 

2*18 

2*42 

2*21 

2*16 

2*35 

2*13 

2*15 

2*29 

2*07 

2*13 

2*24 

2*01 

2*12 

2*19 

1 *96 

2*11 

2'15 

1 *92 

2*10 

2*11 

1 *88 

2*09 

2*08 

1 *84 

2*09 

2*05 

1 *81 

2*08 

2*03 

1 '78 

2*07 

2*00 

1 *76 

2*07 

1 *98 

1 *73 

2*06 

1 *96 

1 *71 

2*06 

1 *95 

1 *69 

2*06 

1 *93 

1 *67 

2*05 

1 *91 

1*65 

2*05 

1*90 

1 *64 

2*05 

1 *89 

1*62 

2*04 

1 *79 

1 *51 

2*02 

1 *70 

1 *39 

2*00 

1*61 

1 *25 

1 *98 

1 *52 

1 *00 

1 *96 


Table 7 is derived from Tables III and V of "Statistical Tables for Biologteal. 
Agricultural and Medical Research" by kind permission of the authors. Prof. Sir 
R A. Fisher and Dr. F. Yates, and the publishers, Messrs. Oliver and Boyd. 
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Table 7 {continued )—Significance points for the variance ratio f 

and for “Student’s” t ( 01 points) 


\ 

0 

1 

2 

3 

y. 


4052 

4999 

5403 


2 

98*5 

99*0 

99-2 


3 

34M 

30*8 

29*5 


4 

21*2 

18*0 

16*7 


S 

16*3 

15*3 

12*1 


6 

13-7 

10*9 

9*78 


7 

12*3 

9*55 

8*45 


8 

11*3 

6*65 

7*59 


9 

10*6 

8*02 

6*99 


10 

10*0 

7*56 

6*55 


11 

9*65 

7-20 

6*22 


12 

9*53 

6*93 

5*95 


15 

9*07 

6*70 

5*74 


14 

8*86 

6*51 

5*56 


15 

8*68 

6*36 

5*42 


16 

0*53 

6*23 

5*29 


17 

6*40 

6*11 

5*18 


18 

8*28 

6*01 

5*09 


19 

8*18 

5*93 

5*01 


20 

8*10 

5*85 

4*94 


21 

8*02 

5*70 

4*87 


22 

7*94 

5*72 

4*82 


23 

7*88 

5*66 

4*76 


24 

7-82 

5*61 

4*72 


25 

7*77 

5*57 

4*68 

us. 

A * 

26 

7*72 

5*53 

4*64 

27 

7*68 

5*49 

4*60 

^2 

28 

7*64 

5*45 

4*57 


29 

7*60 

5*42 

4*54 

4 

30 

7*56 

5*39 

4*51 

3 

40 

7*31 

5*18 

4-31 

2 

60 

7.08 

4*98 

4*13 

1 

120 

6*85 

4*79 

3*95 

0 

<D 

6*64 

4*60 

3*78 


24/V, 4 

3 

AS 6 

8 


5625 

5764 

5859 

5981 

99-3 

99*3 

99*3 

99*4 

28*7 

28*2 

27*9 

27*5 

16*0 

15*5 

15*2 

14-8 

1 1 *4 

11 *0 

10*7 

10*3 

9*15 

8*75 

8*47 

8*10 

7*05 

7*46 

7*19 

6-84 

7*01 

6*63 

6*37 

6*03 

6*42 

6*06 

5*80 

5-47 

5*99 

5*64 

5*39 

5*06 

5*67 

5*32 

5*07 

4*74 

5*41 

5*06 

4*82 

4*50 

5*20 

4’66 

4*62 

4*30 

5*03 

4-69 

4*46 

4-14 

4*09 

4*56 

4*32 

4*00 

4*77 

4*44 

4*20 

3-89 

4*67 

4*34 

4*10 

3-79 

4*58 

4*25 

4*01 

3*71 

4*50 

4-17 

3*94 

3-63 

4 *43 

4*10 

3*87 

3-56 

4*37 

4*04 

3*81 

3*51 

4*31 

3*99 

3*76 

3*45 

4*26 

3*94 

3*71 

3*41 

4*22 

3*90 

3*67 

3-36 

4*18 

3*86 

3*63 

3*32 

4*14 

3*82 

3*59 

3-29 

4*1 1 

3*78 

3*56 

3*26 

4*07 

3*75 

3*53 

3*23 

4*04 

3*73 

3*50 

3*20 

4-02 

3-70 

3*47 

3-17 

3*83 

3-51 

3*29 

2*99 

3*65 

3*34 

3*12 

2*82 

3*40 

3*17 

2*96 

2 *66 

3-32 

3-02 

2*80 

2*51 


2 

1 

0 

t 

12 

24 

CD 


6106 

6234 

6366 

63-7 

99*4 

99*5 

99*5 

9*93 

27*1 

26 *6 

26 1 

5*84 

14-4 

13*9 

13-5 

4*60 

9*89 

9-47 

9*02 

4*03 

7-72 

7*31 

6*88 

3*71 

6 *47 

6*07 

5*65 

3*50 

5-67 

5*28 

4-66 

3*36 

5*1 1 

4-73 

4-31 

3*25 

4-7! 

4*33 

3*91 

3*17 

4-40 

4-02 

3-60 

3*11 

4*16 

3*78 

3-36 

3*06 

3*96 

3*^59 

3-16 

3*01 

3*60 

3-43 

3-00 

2*98 

3*67 

3-29 

2-87 

2-95 

3*55 

3-18 

2*75 

2-92 

3 *45 

3-08 

2*65 

2-90 

3*37 

3*00 

2*57 

2*88 

3*30 

2*92 

2-49 

2*86 

3*23 

2*86 

2-42 

2*85 

3*17 

2*60 

2-36 

2*63 

3-12 

2-75 

2*31 

2*82 

3*07 

2*70 

2*26 

2*81 

3*03 

2*66 

2-21 

2*60 

2*99 

2-62 

2*17 

2*79 

2*96 

2-50 

2*13 

2*78 

2*95 

2-55 

2-10 

2-77 

2*90 

2*52 

2-06 

2*76 

2*87 

2 *49 

2*03 

2*76 

2*84 

2-47 

2-01 

2*75 

2*66 

2-29 

1 *80 

2*70 

2*50 

2*12 

1 *60 

2*66 

2-34 

1 *95 

1 *38 

2*62 

2*18 

1 *79 

1 *00 

2*58 
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Table 8—Conversion from r to x = t anh ‘ r 

and conversely 


r 

z-r 

z 

•00-»*24 

*oo 

•OO-^24 

e25**34 

•OI 

•25-‘36 

♦35* *40 

•02 

•37-^43 

• 41-»45 

•03 

• 44— *48 

*46<*M8 

•04 

•49-•SS 

•49**51 

•05 

♦S4-^57 

*j2**54 

•06 

'58-*60 


•07 

•61-*63 


X • r ♦ (z-r) 


r 

z*r 

z 

•57-^5e 

•06 

.64-^67 

♦59-’60 

•09 

•68-^69 

•61-'62 

♦10 

•70-^72 

•63 

♦1 1 , 

•73-*75 

•64 

•12 

•76- *77 

•65-*66 

♦13 

•78- •79 

•67 

•14 

•80-*92 

•68 

•15 

•83-‘e4 


r • * - lz~r) 



ANSWERS TO PROBLEMS 

(in the text problems are numbered conseattively within each section) 


pp. 26-27 (Section 2.9) 

(1) (6), («), (e) and (/). r. . j . r 

(2) must have the same reduced value as the square of the reduced value ot 

A, i.e., as of o*. i*, 2^, . . . g* or 10*. 

(3) A^ must end in the same figure as does the square of the last figure of A. For 
li A = loB + b, where b is the last figure, A^ = 10 (loB* + zBb) 4 - 6*. 
62,500 = 250*; 41,616 = 204-; all the others are excluded by the results of 
this or the preceding problem, except for J2io, which cannot be a square 
because it is divisible by 2 but not by 2*. 

(4) 110,592 = 48\ 

(6) G.C.F. = 12; factors i, 2, 3, 4, 6, 12. 

(8) G.C.F. = 3; L.C.M. = 1440. 

p. 39 (Section 3.4) 

(1) Minimum 3 when .v = o. 

(2) Minimum 10 when .v 2. 

(3) Minimum 20 when x = 5,2. 

(4) Maximum 44-145 when .v =* 3. 


p. 40 

(5) * = ± V'37 == ± 6 083. 

(6) X = 2 d: VIS = — 1-873 5 ° 73 - 

(7) X = 2-5 ± % 5 = 2^4 ‘‘r 4-73<>. 

(8) ar = -145 or 5 855. 


p. 41 

(9) (x — 2) (x + 2). 

(10) (x + 2) (* 4 - 3). 

(11) None. 

(12) None. 

(13) None. 

(14) a( 29-43 — 4-90S.t). 


p. 58 (Section 3.9) . , , 

Query: at least one fraction would have a zero denominator, and would 
fore be illegitimate. 


there- 


p. 63 (Section 4.2) 

Query: No. For example s 3 4 > • 4) < (3 

p. 64 

liut + A A 4 n I . = 4 M -J’ O* | iA^ i- 4 -* 

and (A ~ a) > o. Hence the result follows. Also it A ^ —1, « = 2, 

A > a but A^ rt*. 

(4) x* 4- 4A- — (—4) =• (x 4 2)* > o; equality if x = —2. 

(5) If X is negative, x 4 l/.x < — 2. 

(6) The rule breaks down. 

(8) Equality if xF = A’y. v'/»i \ 

(io)*fl*'+^6» 4' c* — 3 ^hc = (a 4 & 4 (^* + ** */’ - ab - be ~ ca) 

= i fa 4 - 6 4 - c) r(fl - b)' 4 (6 - c)* + (f “ «)*]• 

687 
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p. 66 (Section 4.3) 

(4) No; e.g. A^2,a^ I, x ^ - 3 - 

(6) Yes. (Divide through by lAa\). 

(7) When * and y have the same sign or at least one is zero. 


p. 78 (Section 5.5) 

(5) 1 cosec 0 I > 1, 1 sec 0 I i., 

(6) tan 6 , sec $, cos 6 , — tan 0 , sin 0 . 

(7) sin 45* = cos 45'' = i/V* = ' 7 o 7 * 
tan 45° = cot 45® = I. 

sec 45® ==* cosec 45 ^ 2 — i*4i. 

(8) sin 30® = cos 60® = x/z. 
cosec 30® = sec 6o® = 2. 

cos 30° = sin 60*’ = \/ 3/2 — '866. 
sec 30® = cosec 60® = z/Vs = *'* 55 » 
tan 30® = cot 60® = i/V '3 == ‘ 577 - 
cot 30® = tan 60® = V 3 ” * ‘ 732 * 

(9) sin is“ = cos 75" = * 259 . 
cosec 15® = sec 75 = 3'8o. 
cos 15® = sin 75® = *966. 
sec 15® = cosec 75® = 1-035. 
tan IS® = cot 75® = •z68. 
cot 15° = tan 75 ® = 3 ‘ 73 - , 

(ti) i-zz km., 118 km., i *37 km. 

(12) -0875 m/sec, 6o*i watts. 

p. 89 (Section 5.11) 

(3) y = (10 — a:)/ 7 , y = — 2 -h 7 Jf- 


( 4 ) Unlrground */o -8 ->./o-3 = ■• British RttiKvays =‘ 1 °^ + * 

(5) if P'n>en- 

dicular if rr + = O. U • • 

(9) The perpendicular to the line from the origin. 

(i) k’ + 15)* = 6 z 5 . (7, —39) and (7, 9 ): (—24, —8) and (15, s): 
(—7, —39) and (15, 5 )- ^ 

(3) Centre (5. o), radius s; ( 3 . 9 ); ( 2 , 4 )- 

(4) Centre (—3. l O. radius ViSo-z. 

^1 ^ s= a* 4- 6 *; tan Z,QPR — 

p. 99 (Section 5.16) 

fi) F — (V7» o); F' = (— V 7 > o)- 

a*lm ± a6\/r6* —/* + 

(4) K, K' have .v co-ordinates = ■ 

and S has x-co-ordinate -a*/m/(6* + o'm). The corresponding y co¬ 
ordinates are given by y = / + 


p. X04 (Section 5.17) 

(,) _4_y* = i; (4- iVS. 0)i i\/5; ^ == ±2/V5- 

p. 123 (Section 6.9) 

(1) 160 calories. 

(2) 100 cm. 

p. 129 (Section 6.11) 

(1) About 7 km. 

(2) 3*3 X 10* metres; about 500 km. 
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p. 131 (Section 6.12) 

Query: M = i/ln 10, or in general i/ln (base). 

(1) ‘6931; f099; I •609. 

(2) 7-389; 2008; 2-202 X 10*; '1353; -0000454. 

p. 139 (Section 6.14) 

(i) 3-762; 3-627; -9640; 1-317; i‘ 444 - 

(.) J In 

(3) (*) i (y — 2), single-valued. 

(ii) ± \/(3’ — i), double-valued. 

(iii) In (i -f y). single-valued, y > — 1. 

(iv) (i —y)/(i + y). single-valued, y + — 1. 
(v) y = :tv(* —.>’*)» double-valued. 


pp. Z4S-146 (Section 7.2) 

(1) log « = 2-64 — -42/; « = 439 ^“'** 

(2) log r = -77 + -052/: r = 5-9^ **'• 

(3) log S = Toil + i log S = -10311 
U) A = io7e-“‘; 9 ' days. 


•41 litre. 


p. 149 (Section 7.3) 

(2) Probably monomolecular. 

( 3 ) y = 53 - 2 KX 8 - 3 ): « ^ 53 - 2 ; * » 

y = 6-29 K = 6 29; B = — •084- 

• 

p. 162 (Section 7.7) , 

(7) When Bi'.Ui + Bt V t = o the third scale would have to be placed jnhnitely 

far away. 


p. l6s (Section 8.2) 
Query: 8/3 m-p.h. 


p. 169 
(0 3 - 5 . 

(2) — I — i/r,r,. 

(3) - 3 (/i + /.)//!* 

(4) -2(1 -f /.)-•(! ^ ^. 

(s) (—1 + ^ + ft f \ ft ) (‘ + 

(6) I -1- li t /j 4 1 fift < ft"- 

( 7 ) (f, + fz)(* -fi^)-'(^ - 

p. 173 (Section 8.4) 

( 2 ) — 2I/(I -i /*)*; I 2 \ /: — I,2/V^ 


(I + li) 


-1 


p. 176 (Section 8.5) 

(1) Derivatives 24-96. 39 00. 56-16 from the formula. 



p. 181 (Section 8.7) 

This may be considered more conxincing because 
to the limit of sin 0 , 0 for small 0 . where U - h (/, 
a matter of opinion, since either proof depends on 


il makes a direct appe.d 
/,). lUn llt.it IN perh.ips 
geometnc.tl intuition. 


p. 185 (Section 8.9) 

(1) i2t 4- 2ir*. 

(2) 3\/t 4 - 1/2\^r. 

(3) 2l sin / 4- t* //cos t. 

(4) 2 (1 4 In t) sin / 1 2 / // In t cos t. 

(5) (* 4 J I” f)l Vf- 

p. 287 (.Section 8.10) 

(7) — 2tl(i + t*)*\ */(• 4 - f)*' 


YY 


— lit (In 0*; In /: — In ^ (/ In / -- t) *• 
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pp. X9Z-193 (Section 8.12) 

(1) 4 + 2 t. 

(2) *3 sin 0. 

(3) 33**5/^283 = I'970. 

(4) Dfx = 2/71(6* —**). 

(s) Dix — i kV {Fait). 

(6) u—9*8 i<. 

(7) ’050 cm/sec. 

(8) '1164 radians/sec = 6*668°/sec; 3'334*/sec. 

p. 196 (Section 8.14) 

(1) t + t It. 

(2) Dtx = V (1 + Diy = zxv (1 -f 4**) 


p. 198 (Section 8.15) 

(1) Take<^ (<) == elB. 

(2) Take ^ («) to be the smaller of the two numbers c/B, k. 

(3) When X —k <x <X + k, \ x —X \ < k == <f> («), therefore | y 

<€, |y ( < I y I + < = B. . ^ ^ 

(4) In Problem (3) replace y by Sy/Sx. This —^ dyjdx and is therefore bounded 
when X — k <x <X + k. The result follows from Problem (2). 


-y\ 


pp. 200..201 (Section 9.1) 

(1) Errors 008727 cos 6 , 008727 (sec 5 )*; proportional errors *008727 cot 0 , 
•008727 sec 0 coscc 0 . 

(2) ttRIioo cm®. 

(3) — *0048 radians — — * 275 “. 

(4) 4 per cent, 2^ per cent. 

(5) 45". 


p. 202 (Section 9.2) 

(x) ln.v; xly. 

(2) 2x,.v; —**/>>*. 

(3) —tl2\'{x + y); ~ i/2-s/(^ + y)' 


p. 206 (Section 9.3) 

(1) (xSx +yBy)lVix‘ + y*). 

(2) Sm/r® — 3mSi/s*. 


pp. 2X5«2 i 6 (Section 9.5) 

it) 8 M = - 2 K( 8 RIR^ + SL!L*). r . 

(2) tfx = [sec ix + >•)]* -f [sec (* —y)V; tcy = (see (* + y)] 

(3) tvx = — 4 xyV{x* —y*); = 4 x*y/{x* —y*)- 

p. 222 (Section 9.7) 

(1) Vt = 7r^®(4/? + Ai = Sn^i^t-'i* + at' »)/3. 

( 2 ) rx,y = xlV(x^ + .v=>: = ylVix* + y’): n = (13 — 

(3) \ 2 — 2^B.v/^*^+ y* + 4)”; /y = — zBylix* + y* + 4 )*; 

// = — 72B (i3< — 24o)/(i3X* — 480# + 4644)*. 


— [sec (*—y)]*. 


4S* + ^ 


p. 223 

(4) 7>,« = a 09 4- 2B)/mB09 — B)‘; T„,p = — aFln*Ri^ — P)». 
p. 230 (Section 10.2) 

(j) 3^ _j_ 4- C; -Jt* — cos t 4- C; 2t 4 - In t 4 - sin t + C. 

(2) (st 4 - it* 4 - C) mm. 

(3) iet 4- C. 



ANSWERS TO PROBLEMS 


691 


p. 234 (Section 10.3) 

(2) t + + el + C. 

(3) «» + + C. 

(4) — cos t — i cos zt — i cos 3/ + C. 

(5) — i/< + 3 In ^ + 3 * + + C. 

(6) 2 tan“* t — 2t + C. 

(7) 9 sinh~‘ (iO —4 cosh“* (if) + C. 

(8) — cos t + sin f — In (± cos f) + C. 

(9) i (cosh 3/ + sinh 3/) + C = ie®* + C. 
(to) i (cosh 3/ —sinh 3f) + C = ie“** + C. 

(11) f + if* + if® + it* + C. 

(12) < — if* + if® — it* + if* + C. 


p. 238 (Section 10.4) 

(1) 2ev^< + C. 

(2) sin (f + 3) + 

( 3 ) ~Vii -t^) + C. 

(4) — i cos (2f* + 3) + C. 

(5) — i (In cos f)* + c. 

(6) -i(cosf)^ + C. 

(7) (1 + e‘) In (i + e') — * — e' + C. 

(8) i(2f + 3 )*'® + C. 

(9) — i / (i + sin f)* + C. , ^ 

(10) (f + i) log (4f + 2) — (f + i) '-V/ 4 - C. 

(11) tanh“‘ sin x + C. 

(12) V (I + <*) + C- 
O3) In (i + el) + C. 

(14) I (i + In f)*'* + C. 

(15) i (sin f)* + C. 

(16) i [In (I + f®)]®'* + C. 

p. 241 (Section 10.5) 

(1) f sin f + cos f 4- C. 

(2) i f’ In f — i f» 4 - C. 

O) (<* — 2f 4- 2) e‘ 4 - C. 

(4) (2 -- f*) cos f 4- 2f sin f 4 - C. f /, 

(5) i< V(» 4 - t*) + i s‘nh-' f 4 - C. (A first integration by parts gives J t- </f 

f V(l 4 - f») * f (« 4 - second integral can be uritten 

J*(i 4- f*)(i H- 't^C. This 

gives an equation for J v dt.] 

(6) From Problem (i). f 0 co^ 0 dO = O sin 0 4 cos 0 + C. Put s.n ty = /. 

this becomes J sin'* I dl -= t sin'' / + cos (sm ' f) 4 - C. bo the term 
^ (i _/*)i must be taken to be cos (sin"' f). 


p. 2 S 3 (Section 10.9) 

(1) (A).y = t/V(C -2/). 

(2) (A), y = sin-‘ (f 4 - C). 

(3) (A). >. = Vfi -I- CO* - I)]. 

( 4 ) (B). >- - V(Cf -/*). 

(5) (A), >- » 4 /(C - f). 

(6) (A), >- = cos-'(Ccosec f). 

(7) (C), 2(y + i)/(f 4 - 1) + tan In [C(y 4 1) 4 - CO 4 t) J 

(8) (F), y = f* 4 - Ctei. 

(9) (H), y = f 4 - e>*i*/(C — W). 

(10) (I) or (A), y = sin-' (C cosec f). 


p. 256 (Section 10.10) 


(I) y 
( 4 ) y 


9 Oft)- 
9 O' - «•). 
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p. 259 (Section ii.x) 

(0 3 / 2 - 

(2) 19/4. 

b) 

U) I- 

b) o. 

p. 264 (Section ii.z) 

(1) i \/ 32 . 

(2) c — I. 

p. 267 (Section 11.4) 

( 1 ) JCr = z -h 3f; ^7 = 23 * 

b) S == II, A'r = — 6 4- im 

p. 269 

(3) n (zn — i). 

p. 276 (Section 11.6) 

(2) (3 + 2«). 

(3) 3r‘ —r; n* + n». 

p. 284 (Section 11.9) 

(3) cos“> (a/R) — a V(R* - «*)• ^ 

(4) Z.*cos-> ^ cos-* (1 ~^) --Ri. V (’ = 

7^2 ^ 20 COS 26 —sin 20 ] where L — zR cos 0 \ this = ^R* when 

8 s ’952 radians, L — i*i6oi?. 

(6) RH {0 + 2). 

(7) ^^(^2* -Ri") VU + r>/(R2 -Ri)*]- 

p. 293 (Section 11.12) 

(2) i 7 T L^. 

pp. 295-296 (Section it. 13) 

(1) 1500 cm*. 

(2) 3*927 X to-^cc. 

(3) 4*4 X lO*. 

(4) 9-5 X 10*. 

p. 302 (Section 11.15) 

(1) 2. 

P* 307 (Section 11.17) 

(0 

b) e~*. 
b) i"* 

p. 312 (Section 11.18) 

( 1 ) «-*•*? — 

(2) o. 

b) o- 

(4) J sin I. 

p. 314 (Section 12.1) 

(1) 2. 

(2) 2//®. 

(3) — tit*. 

(4) (61* — 2)/(i + t*y. 
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p. 316 

(5) yt = sinh /, yn — cosh t, yui = sinh t, and so on alternately. 

(6) yt — — K sin /C/, yu = — cos Kt, ym = sin Kt, ynu = K* cos Kt, 
and so on. 

(7) yi = B + zCt + er, yit = zC + ei, all higher derivatives = e'. 

(8) iD,)t’y = (— i)" \n — 1 (i + O"". 

pp. 322-323 (Section 12.2) 

(1) Width = mice depth. 

(2) t — z ± i/V 3 - 

(3) t = «• 

(4) o; 

is) i: 1- 

(6) Two metres from strongest lamp. 

(7) Radius of circle = height of rectangle = 6/(4 + n) metres = *840 m. 

(8) 100 cm. 

(9) Square 8/(w -f 4) = fizom; circle 2?r/(fl- + 4) = •880 m. 


p. 325 (Section 12.4) 

(1) — sin t + zt. 

(2) Cs + C^t + ir* + 

P- 327 (Section 12.5) 

(x) I *1; I ' 105 . 

(2) 1-05; 1-04875. 

P* 332 (Section 12.7) 

( 1 ) r — r*/2 4- r’/s — rv4 + etc. 

(2) M-HT — T^lz 4- r’/3 — r*;4 ^ etc.> 

(3) T — T’/b + T'Vls — '^■’-‘17 4- etc. 

(4) I — r*/l2 + rvk- etc. 

(5) I 4- iT — i 4 - hT^ — 1 048809. 


(1) 

(2) 

(3) 


p. 333 (Section 12.8) 
t ^ — J ; nunimum; 
/ Oj minimum; 


/ = — I, neither maximum nor minimum. 
/ sr maximum* 


At beginning of range, minimum if first non-zero dcrivati%c > 
if first non-zero derivative < o. At end of range, vice versa. 


o. maximum 


P* 337 (Section 12.11) 

(1) < 4- « — if* — Jf*“ — 

(2) I 4 - if 4 - fu — if*. 

p. 346 (Section 13.i) 

(1) i; -i; x/io"-‘. 

(2) 3 6; 5 ; 5 ,, 

(3) 32; -ii -^r (-i)"- 



pp. 346-347 (Section 13.2) 

(1) 5 X 2" animals; (z" —i)X to* square metres; 10.230,000 square metres 

(2) i? [i -( - *H- 

p. 348 (Section 13.3) 

(1) zrlii — r)«. 

(2) 6r/(i -r)\ 

p. 350 (Section 13.6) 

Query; only the scries o f o f o -i ... with sum o. 
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p. 355 (Section 13.7) 

(1) (— — a. 

(2) — I. 

(3) (— i)”"' 12m — a . 

(4) /*"*~*/| 2m— !♦ 

(5) t^"*~V |2m — 2. 

(6) 

(7) mf”*"*. 

p. 366 (Section 13.11) 

(1) The Binomial Series. 

P* 377 (Section 14.6) 

(1) If a, b, c, d are the four vectors from the origin O to the vertices of the 
tetrahedron, the vector OG is J (a 6 + e + <#). 

p. 378 (Section 14.7) 

(i) Replace w in (14.11) by (—w). 

p. 382 (Section 14.12) 

(i) {i. o), (i, 2Jr;n}, {i, 47 r/«}i . . • , {i, ( 4 « —2) ”ln)- 
P- 383 (Section 14.13) 

(7) ——«». — "*• , , 

( 8 ) {1, 77/3} and {1, 4 W 3 }* 

(9) Wr, i^} and {Vr, \0 + »}. 

p. 388 (Section 14.17) 

(1) 10 + 4J. 

(2) —2. 

( 3 ) 

(4) 3 — 2*- 

p. 389 (Section 14.18) 

(14) The vector z I is reflected in the x-axis. 

P‘ 393 (Section 1419) 

(3) i In 2 + (ar + J) m. 

(4) I + (2r + i)iri. 

p. 393 (Section 14.20) 

(1) cos I 4- » sin 1 ss *54 + •841. 

(2) — *• 

(3) 

P- 394 

Query: Ln exp z = z + 2nm where « is an integer, 
p. 396 (Section 14.21) 

(4) o- 

(s) d: {V2. = ± (1 + *)• 

(6) Three roots, except when z = o; ^ 0 }, + |w}, 

p. 398 (Section 14-23) 

(i) cosh (x + iy) — cosh x cos >• + 1 sinh x sin y; sinh (.v 4 - iy) =* sinh x X 
cos y + i cosh x sin y. 
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P- 399 

(3) I + it (e~^ — e) sin 1 ’83 — '99* ; 

(ei^ + e~i”) sin i — e~i^) i cos i ^ i '20 — 2*02 i. 

(4) cot z = 4- sec z = 2/(e^e + 

2 il(e-li —. eii). 

(5) cot z = i coth iz', sec z — sech iz\ cosec z = i cosech iz. 

p. 406 (Section 15.1) 

(2) ie^ (cos 2X + 2 sin 2x) + C. 

p. 408 

(3) (* — X*) cos X + 2x sin X 4 - C. 

(4) — cos 2x + i sin 2x + C. 

(5) 4 e* (x cos X + X sin x — sin x) + C. 


p. 4x4 (Section 15.2) 

(1) 4 —2ff-'($in X 4 - 4 sin 3x + 4 sin • • •)• 

(2) cos X + *3 cos 2x + *52 sin 2x + '2 sin 3X. 

p. 420 (Section 15.3) 

(1) i/(x — i) + i/(* + i)- 

(2) 5/2(x — i) — 7 /(* —2) 4 - i:/2(x —3). 

( 3 ) 3 /(* 4- i) — */(* — O + 2/(-v + 3). 

(4) s/6 (x — i) 4 - 2/3(x 4 - 2) — 3/2 (x 4 - 5 )- 


(0 

( 2 ) 

(3) 

(4) 

( 5 ) 


p. 424 (Section 15-4) 

— 4 In (± x) — i/2x 4- i In (± (a- 4 - 2)] 4 - C. 

In (± (I -X)] 4 - 2/(1 -X) 4 - C. o 

4 In r± (x — 2)] — s /3 (x — 2) — J In (± (x + 1)] 4 - C. 

- 4 In [± (X 4 - 1)] - 1/3 (X 4 - I) 4 - 4 In [± (x 4 - 4)1 + C. 

In [± (x 4 - 1)1 4 - 3/(x 4 - i) — 3 (x + 1)* — i /3 (v 4 i)* 4 - C. 


p. 428 (Section 15.5) 

(1) i i l.n (x 4 - i) — 4 » I-n (x — i) 4 * C. 

(2) 4 Ln (x* — tt>*) 4- C. 

(3) i Ln(x* 4 - I) 4 - C. 

p. 452 (Section 16.14) 

( 1 ) (g. 4). 

(2) (1-4, i-i, -4). 

p. 491 (Section 17*6) 

<*)x = 3 .>'“—«- 

(2) X = 3. >- = 2. z = I. 

(3) X = -875, y = — * 5 . 2 = ‘^ 75 - 

p. 496 (Section 17.8) 

(1) Unique solution. 

(2) Unique solution. 

(3) Redundant, 

(4) Unique solution. 

(s) Inconsistent. 

p. 500 (Section i 7 - 9 ) 

(2) 6; I; I. 

0) o; *8. 

P* 505 (Section 17.15) 

(i) o; 3 (<y* — w). 


cosec z = 
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p. 509 (Section 17.18) 

(i) The minors are 



respectively. For the co-factors reverse the signs of the numbers marked with 
asterisks. 

p. 510 

(3) This is equal to a determinant with a b column instead of a’s, i.e. with two 
equal columns, and therefore zero. 

(4) This is equal to a determinant with two equal rows. Replace **column*’ by 
“row” in (3). 

p. 522 (Section 18.6) 

(2) a» -1- a-b -I- aba -f i>o» -j- ab* -|- bab -j- b*a + 8. 

(3) 2 (ba —ab). 

(4) 2 (a'b — aba bo* — b*). 

PP* 53**532 (Section 18.13) 

(1) Latent root Aj = 5, column [i, 2]', row [i, ij; latent root A, = — 1, col¬ 
umn [i, — 1}', row (2, — t], 

(2) A, = L u(,) = [o, k, o]; A. = 1, 0(j) = [2I, I + m, zm]. 

(3) The latent roots are still S|, 8 g, . . . 8 n, but there are other vectors. 

p. SSI (Section 19.8) 

(■> t. A BO (= B) 

00 (= O). If Doreen is O, Bill has chances i AiO (= Ai), i BO 
B). Bessie is A^O. 

(2) child has chances i AjAt -h itAjA^ -t- iAiO (i.e. iAt), iAtB. 

^ i so ( — S). If Herbert is the second child has chances 

Mi- 4 * + \A^Ai + ^AiO (i.e. Mi). kA.B, iA^B, i BO (= B). 

P- 554 (Section 19.9) 

(1) 56*25 per cent O, 10*99 per cent B, 4 59 per cent As, 25*65 per cent Ai, 
2*10 per cent AiB, *42 per cent AsB. 

(2) s; p. 

(3) (0 (J + /> 9 )/(» + 9); (ii) (1 + pq)l{i 4 - q)', (hi) 1/(1 + q). The theoretical 

proportions are (o) i—<?*=*13S; (*) (i +/>7)/(i + ?) = '55*; (c) 

*/(* + 9) — *5*8; and the obser>*ed proportions are 120/900 = ‘iss, 
56/100 = *560 and 19/41 = *463 respectively, with good agreement. 

pp. 560-561 (Section 19.12) 

(1) 24; 720; 420; 840; 34650. 

(2) 252 X 3^/4“' — • 0583 - 

(3) If the frequency of M is p = -545, Wendy has a chance p/(2 — p) = *375 
of being iV/AT, and 2(1 —P)/(2 —P) = *625 of being ^JV. 

p. 597 (Section 20.11) 

(*) • 925 - 
(2) *5*2. 

p. 624 (Section 21.4) 

(*) X* — '92, with 3 d. f., good agreement. 

(2) X* = *5*4*. with 3 d. f., highly significant. 

p. 642 (Section 21.8) 

(1) Solve 0 (z + d)/(x — 0 )* = .xTvIyz; estimate = *0356. 

(2) 6 — = *672, with standard error X 113/344*) = *025. 
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(1) li’; 3^; 5+-; 35t; 3€; z€if; tzil; it^; iCC; 21222. 

(2) 16; 48; 7; 71; 163; 68; 256; 2998. 

p. 663 (Section 22.3) 

(1) 102C; €2; it; f'32. 

(2) —22; —26; —234; —18. 

p. 665 (Section 22.4) 

(1) 1032. 

(2) 4413. 

(3) iz£f4. 

p. 667 (Section 22.6) 

(i) 2002; 2002; 324; 202t’: 12223c. 

p. 668 (Section 22.7) 

(1) III; 31; 12C. 

(2) Quotient 4C2, remainder 22, 


p. 669 (Section 22.8) 

(1) 'll recurring (i.c. "iiilil . . .); •i43l*’C recurring; •i2C{23 recurring; 

•03c recurring. In the first three the last half of recurring period is simply 
the first half with reversed sign. 

(2) •3l't'Ci4 recurring; •43lf'Ci recurring, and so on. 'I'hese can all be ob¬ 
tained from the decimal for i,iC (= j 7) by taking digits from the front of 
the recurring part and adding them to the end. 


p. 671 (Section 22.10) 

(1) If a whole number « ends with the figure m, then tr and ni^ end with the 

same figure (see Problem 3, Section 2.9). Thus when ti ends with o, ends 

with o; when « ends with 1 or I, n* ends w ith i, and so on: in no case does 
a square end with 2. 2, 3, C. 

(2) If the number ti ends with m, then anti in ' eml with the same figure. Bv 
trying all values for the digit in, we find that if « ends w ith o, 1, |. 4. 

then so also does «’; if n ends with 2, 2. 3. or f its cube end,-, with 2, 2, C, 3 
respectively. 

(3) Numbers ending with o. 2, z, 4, are divisible by 2, and those ending with 

Si ^i by 5, and so are not prime. 

(4) I 2 C 

t- o 4 
3 z I 
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A-B-O blood-groups, 547. 55 i. 553-41 
gene frequency of, 554 
abscissa^ 69 

absolute magnitude or value (modulus) of 
complex number, 383-4; of real num¬ 
ber, 64; of vector, 377 
absolute maximum or minimum, 43, 317 
absolute temperature, 161,464 
acceleration, 313-16, 319-26; as deriva¬ 
tive, 315; of electric charge, 474. 478 
accuracy, 4-6, 13, 333-4 
Achilles and the tortoise, 350 
acre, 439 

acuity of vision, 128 
adaptation, 43, 44 

addition, by means of logarithms, 115-17; 
nomogram, 151-3. 162; punched 

cards, 23; slide rule, 150; in Colson 
notation, 664-5 

addition of complex numbers, 384-6; 
definite integrals, 308; derivatives, 
181-2; determinants, 500-1: frac¬ 
tions, 11-12, 58; indefinite integrals, 
229; logarithms, 107-8; matrices, 
515-16; polynomials, 30-1: proba¬ 
bilities, 543-4; series, 363, 403 i 
gonometric functions, 70, 80, 83; vec¬ 
tors, 374-6 

adult, height of, iiS, 592-4 
air-cells in lungs, 295 
Aiiken, A. C., 523. 537 
algebra, 28-49: in geometry, 68-83: 
laws of, 30-32, 57-8, 373-6. 379 - 86 , 
515-22, 525-8 
algebraic factors, 40, 41, 44 
algebraic fractions, 55-8; graph of. 56; 
integral of, 415-31 ; in partial fractions. 
4 »S- 3 t 

alignment chart, see nomogram 

allomctric relations, 120-3 

almost impossible, 542 

ampere, 442. 470 .or 

amplitude of complex number, 383; 

harmonic motion, 409; of radiation, 479 
analyser, differential, 18 
analysis (function theory), 364 
analysis of variance, 574, 628-32, 676 
analytical geometry, 68-83 
Anderson, E., 620 
angle, antilogarithm of, 106, 129; 

lar measure of, 125; eccentric, 95: 
hyperbolic. 135: in triangle, 69; 
measurement of, 70; radian unit of, 
125; subtended by arc, 134: trigono¬ 
metric functions of, 71 -83 


Angstrom, 479 
anthrax spores, 142-4 
anti-derivative, see indefinite integral 
antilogarithm, 106, 129: derivative of, 
178-80. 186; series for, 353, 355; see 
also logarithm, exponential 
antiseptic, 142-5; estimation of, 479-80 
antisymmetric matrix, 517 
antitoxin, 467 
apothecary’s units, 441 
applied mathematics, i 
approximate equality, 127; see approxi¬ 
mation 

approximation, error in, 216-18, 333 - 4 i 
general, 32‘>-32. 335 - 7 : order of. 

328-9; successive, in solution of equa¬ 
tions, 483-6. 491-3. 5 «o-i 3 : to an 
algebraic fraction, 57: to a definite 
integral or area. 302-5; to a factorial. 
565 -6; to a logarithm or antilogarilhrn, 

1 30; to a probability, 561 - 3 ; to a poly¬ 
nomial, 37 

Arabic numerals, 660-2 
arbitraiy constant, 226-9, 245-6 
arbitral^' functiivn, 254 
arc, circular, 124-6, 293 


are, 439 _ , r • , 

area, principal formulas, 293; or circle, 

280. 293. 305: of rone. 2S3. 293: ot 
curved strip, 279'. of cylinder, 281, 
291, 293; of cllip'e, 264, 293: of frus¬ 
tum. 284; of loops, 278. 296-300, 
304-5; of p.irallelograni, 2S4. 293: ot 
polygon, 301 2; <»f pri'-in. 281-2, 293; 
of secior,'280-1, 293: of sphere, 283-4. 

; of splicroid. iriJin^lc, 

284, 293* 302; under p.irahnlii, 261, 
274 5; under recUmcuLir liyperbola, 

263 

arca» unit> uful conver^'H^n lactor^, 430 * 
and ueitfht, 146: nuineric.il evaluation 


(Simpson’s rule, ctc.>, 302-^; as a 
<lerinite inlci^ral, 260-2, 26<^, 275# 

278-X4, 2<)0 ^00; l)v summation, 

264 6, ’26.,. 274 -- 5 : diu.mcc c\prc-ed 
a.s, 262: loe-irithin expressed as, 2(13: 
probabilnv expressed a-. 583 4, 594 * 5 1 
sign convention lor, 296-300; ot air- 
cells in lungs, 295: of bacillus, 21)4 ; ot 


vvounvl. 146 

aridiinetie. 1, 9-28: checking of, 19-22: 
division m. 15 multiplication in. 

16. 18 ; series or progression-., 53; in 
Colson notation. 664 8 
arilhnietico'geoinetric series. 347 
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arrangements, number of, 558-9 
array, see matrix, determinant, vector 
artery, flow in branched, 321 
ascending factorial, 357 *^ 3^5 

ascorbic acid, 480 
assay, 479—80 

association, test for, 620-1, 623-8 
asymptote, 102-4 
atmosphere, 455 
atom, size of, 142 
atomic weight, 463-4 
Australia, 43 

average rate of growth, 168-9; velocity, 
164, 169, 172-3; ree a/so mean 
averages, “ law of ”, 545 
axiom, in geometry, 69 
axis, major or minor, 95, 102; of co¬ 
ordinates, 35, 69; of ellipse, 95, 105; 
of hyperbola, 102, 105; of parabola, 

9J 

bacillus, area of, 294 

bacteria, growth of, 119, 134, 2 Z 4 -S» 
242-6; metabolism in, 211, 294; sur¬ 
vival of, 142-5; sulphur in, 296; vol¬ 
ume of, 294 

bacteriophage, size of, 141 
Baggally, 57 

baldness, premature, 554-5 

balloon, 2:9-23 

bar (unit of pressure), 455 

base, of logarithms, 112, 124 

bees, honeycombs of, 323 

Benedict, 123 

Bcrkson, J., 480 

Bernouilli di^erential equation, 251 
Bessel's function, 367 
beta integral, 567, 568 
bias, 637 

Bickley, W. G., 304 

binomial coeflFicient (reduced factorial), 
357-61; distribution, 578-80, 595-7, 
616; probabilities, 557-63, 578-9; 

series, 355-60, 403 
bio-assay, 479-80 
birth-weight, 581, 653-5 
blood, flow of, 321; cell, 201; colour- 
index, 161 

blood-groups, A-B-O, 547, 551, 553 - 4 ; 
M-N, 368, 546-53. 561. 613-14, 642; 
Lewis, 643-s; Lutheran, 632-4; com¬ 
parison of Britain and Latvia, 368 
Bomstein, 467 
bound, 55, 67 
Boyle’s law, 55-6 
bracket symbol, 258 
branches, of function, 138 
Briggs, and logarithms, 110-11 
brightness, 479 
British units, 160. 438-44, 59a 
Brodetsky, S., 153 
Brodie, 458 

Bromwich, T. J. I'A., 274 


Bruun, A. Fr., 118 
Bum, J. H., 480 

calculating machine, 14—18 
calculation, see evaluation 
calculus, differential, 164; history of, 
198; see also derivative, partial deriv¬ 
ative, differential equation, integral, 
indeflnite integral, definite integral 
calendar, 436 
Calkins, 36 
Callender, S. T., 632 
calorie, 155, 214, 459, 471 
calorific value of diet, 155 
Cambridge, 166 
Camburgh, 517-20 

capacity, electric, 475-6; see also volume 
carbohydrate, 155 

cards, playing, 540, 542-4; punched, 
22-3 

Carrel, 146 

cartesian co-ordinates, 69-72, 207-11 
casein, 149 

casting out nines and elevens, 20-21 
catenary, 132-3 
cell, blood, 201 
Centigrade scale, 34, 464 
centre of mass (of gravity), 45 J- 4 » 576 
centroid of triangle, 376, 454 
C.G.S. units, 44*. ^2. 460, 464, 466 
chance, see probability 
change {see also small changes, change of 
variable), and movement, 163; bracket 
symbol for, 258; delta symbol for, 
166; continuous, 163; discontinuous, 
163; of unit, 442, 443, 460-3, 590-2; 
rate of, 163—98 

change of variable, in distribution, 590 -;i> 
602; in derivative, 187—9;^ in partial 
derivative, 209-10, 53®; in definite 
integral, 307-12; in indefinite integral, 
234-8; in multiple integral, 539; in 
matrix form, 538 

characteristic, of logarithm, 112—13; 
equation of matrix, 532, 5341 root, 
value, vector, see latent root, latent 

vector . . 

checking, of arithmetic, 19—22; of solu¬ 
tion of differential equation, 245 
chemical reaction, 57, 148, i 49 . 3 **. 
464—8 

Chick, Dame H., 142-5 
chicken embryo, 234 
Chinese discovery of calculus, 198 
X* (chi-squared), 623, 628, 676, 683 
circle, area of, 280, 293, 305; equation of, 

93 . 

circuit, electric, 475 ”® 
circular arc, 124 . 

circular function, see trigonometric func¬ 
tion 

circumference of circle, 125, 293 
Clark, 145 
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codeviance, 599-600 

coding of data, 22-4. 585-7. 590-2. 604-5 
coefficient, of correlation, see correlation; 

of regression, 647-8 
cofactor, 509-10, 526 
coil, 474-6 

coin tossing, 540-«. 544. 561-2, 569-70, 

576. 578, 596 

colour index of blood, i6t 
Colson, J., 4, 17, 660-1 
Colson notation, 4, 17. 104, 660-71; 
arithmetic in, 664-8; decimals in, 
663, 669; logarithms in, 671; nega¬ 
tive numbers in, 662-3; tables in, 
670-1 ; time in, 661 

column(s), equal, 504; expansion by, 
510; interchange of, 502-4; total, 369, 
597. 600 

combinations, see arrangements 
combined estimate, 623 
common denominator, 12, 58 
conrunon factor, 27 

common logarithm, iio-ii, 673-5, 678-9 

common multiple, 27-8 

comparison, of complex numbers, 386; 

of magnitudes, 62-7, logarithmic, 118 
completing the stjuare, 37-8, 341-3, 434, 

654 

complex number, 379-406, 424-9; defin¬ 
ition of, 379; as matrix, 523-4: addi¬ 
tion of, 384-6; amplitude of, 384: 
conjugate of, 388-90; cube root of 1, 
382; derivative of function of, 401-2; 
distributive law for, 385-6; division of, 
383; exponential of, 393 - 5 . 397 - 40 °. 
402: hyperbolic functions of, 398; 
imaginary, 382, 386-8; integration of 
function of, 404-5; limits of function 
of, 401; logarithm of, 390-3, 402; 
magnitude or modulus of, 383-41 
multiplication of, 379-81; powers of, 
381, 396-8; real and imaginary parts 
of, 386-8; real number as, 379, 381-2; 
reciprocal of, 383 ; roots of, 382, 395 - 6 ; 
subtraction of, 384-6; Taylor scries of 
function of, 364. 403-5; trigonometric 
function of, 398-403 ; vector represen¬ 
tation of, 383; 2cro, 381 
component of vector, 371-2 
composite number, 10 
computation, see numerical evaluation 
Comrie, L. J., 55, 116 
condenser, 472, 475-6 
conditional probability, 549-50 
conductivity, electric, 147; heat, 465 
cone, area of, 283, 293 ; volume of, 286-7, 
294; double, 100 
conic sections, 99-101, 104, 105 
conjugate complex, 388-90, 428 
constant, 29; dielectric, 420, 472; of 
integration, 226-9; 245—6; of dissoci¬ 
ation, 468-9; of nature, 462 
constraint, 219-25 
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continuous distributions, 580-92, 603-7 
contour line, 49 

convergence, 13, 350-2, 403, 576; geo¬ 
metric, 351-2, 370, 403; tests for, 352 
conversion factors, for area, 439; energy, 
force, power, 443-4; heat, 460; length, 
439; mass, 440-1; pressure, 455; 
temperature, 464; velocity, 441; vol¬ 
ume, 440 

cooling, rate of, 242, 244, 410 
co-ordinates, cartesian, 48, 69—72, 207- 
II: polar, 70-2, 207-ti; origin of, 35, 

69 

Cope-Chat, 22-3 

corrected sum, of squares (deviance), 
577; of products (codeviance), 599- 
600 

correction, for error, 200; for mean, 577, 
600; Sheppard’s, 587, 605 
correlation, 602 el seq.-, interpretation of, 
609-10: matrix, 613; range of values 
of, 602-3, 612; sample value. 603-5, 

609- 10, 612-13, 620. 622-3; signifi¬ 
cance of, 675-6, 683; standard error 
of, 619-20: true, 603, 6o8-<): c-trans- 
formation of, 620. 622. 677, 686 

cosecant, cosec, cosech, see trigonometric 
functions or hyperbolic functions 
cosine, cos, cosh, series, 355, 403: see 
trigonometric functions or h\pcrbolic 
functions 
coulomb, 470 
cousin marriage, 555-6 
covariance, sample value, 599. 605, 608, 

610- 11, 614, 619, 656-7; true value, 
603, 606. 608, 612, 614; matrix, 613, 
656-7: multinomial, 614: standard 
error of, 619 

crude sum, of squares, 577 8. 587, 629; 

of products, 600 
cube root of 1, 382 
cubic ei|uation, 481-3 
cubic polynomials, 41-4 
Cullwick, E. G., 469 
cumulative distribution, 570-1. 580-1 
current, 469-76 

curve, isocritic, 654: length of, 288 9: 

motion along, 194 6; tangent to, 174 
curved strip, area of, 279 
cylinder, area of, 281, 291, 293; volume 
of, 285-6. 291, 294; section of. 96-8 

D = derivative, 175; dioptre, 479 
dachshund, 44 
damped oscillation, 478 
day. 436-8 

decimal, point, placing of. 19; recurring. 
13, 349-50; infinite or uneruling, 13, 
93. 350; Colson, 663, 669 
decomposition, of casein, 149; of tetano- 
lysin, 148: of vibriolysin, 149; into 
partial fractions, 415 31 
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definite integral, 257-66, 269; 275-300; 
addition rule for, 308; area as, 259-66, 
269, 275. 278-84, 290-300; beta, 567 
-8; by parts, 308; by summation, 
264—6, 276-8; change of variable in, 
307-12; definition of, 258; distance 
as, 257-8; gamma, 566-8; infinite, 
306-7; length of curve as, 288-9; 
limits or termini of, 259-300; multi¬ 
plication by a constant, 308; numerical 
evaluation of, 302-5; of velocity, 
257-8; quantity of electricity as, 259; 
Simpson’s rule for, 303; substitution 
on, 307-12; volume as, 284-8, 290-6; 
work as, 276-7, 300 

degree, of angle, 70; of temperature, 34, 
464; of freedom, (physical) i66, 
(statistical) 574, 624-6, 629-32, 648, 
676, 683-5 

delta, sign for difference, 166; Kro- 
necker, 521 
Denmark, 625-6 
denominator, common, 12, 58 
density, 444-5; of probability, 584, 
591-2,594.612 

dependence, test for functional, 538 
dependent variables, 201 
derivatc, see derivative and partial deriva¬ 
tive 

derivative {see also partial derivative), 
174-5; 191-2; standard forms, 191-2; 
of algebraic functions, 172-3; of func¬ 
tions of functions, 187-9; of hyper¬ 
bolic functions, 187; of inverse func¬ 
tions, 190; of logarithms and anti- 
logarithms, 178-80, 186-7; of powers, 
190-1; of a product, 182-4; of a 
quotient, 185; of a series, 364; of a 
sum, 181-2; of trigonometric func¬ 
tions, 180-1, 184, 187 ; acceleration as, 
315; change of variables in, 187-9; 
complex, 401-2; higher order, 316, 
329-34; second, 315, 319-32; succes¬ 
sive, 316 

derived function, 194; ree derivative nwd 
partial derivative 
derived units, 442-3 
Descartes, 68 

descending factorial, 357-61, 365 
determinants, defined, 497-g; addition of, 
500-1; cofacior in, 509-10, 526; ele¬ 
ments of, 497; expansion by row or 
column, 510; interchange of columns 
or rows, 502-4; Jacobian, 537 - 9 ; 
linear equations and, 496-7, 505-8; 
minor in, 509-10, 526; multiplication 
of. 500, 526-8; numerical evaluation 
of, 507; of matrix, 515, 526-8; prin¬ 
cipal diagonal of, 497 ; second and 
third order, 499; sign of term in, 
498-9; with equal rows and columns, 
504; transposition of, 502, 515 
deviance, 577, 587, 599 


deviation, mean, 572-3; probability of, 
594; standard, 574“5. 578. 585. 587-8, 

591-2, 594. 599. 602, 604-5, 630 
diagonal, of determinant, 497 
dielectric constant, 472, 478 
diet, heat value of, 155—6 
differences, in interpolation, 54, 81-3, 
670, 673-5; mean, 82; negative, 82; 
standard error of, 621-2, 630; small, 
see small change 
differential analyser, 18 
differential coefficient, see derivative 
differential equation, methods of solu¬ 
tion, 247-56, 366-7, 476-8; exact, 
251-2,338-9; Bemouilli, 251; homo¬ 
geneous, 247-9; integrating factor of, 
252; linear, 250, 476-89; partial, 
253-6; Riccati, 251; with separable 
variables, 247; solvable for variable, 
249-50; arbitrary constant in solu¬ 
tion, 245-6; arbitrary function in solu¬ 
tion, 254; check on solution, 245 ; first 
order, 243-53; integral or solution of, 
243; of chemical reaction, 467-8; of 
cooling, 242-4; of population growth, 
242-6, 250; second order, 324, 409, 
476-8; series solution of, 366-7; 
special solutions of, 252; use in prov¬ 
ing addition theorems, 256 
differentiation, 174; see derivative and 
partial derivative 
digestion, 147, I 49 . ^93 
dimensions, method of, 460-3 
dioptre, 479 

diphtheria antitoxin, 467 
directrix, 92, 98-9, 101 
discontinuity, 178, 242 
discontinuous distributions, 569-78, 597 > 
603 

discriminant functions, 649-59; for 
placenta praevia, 650; for species of 
iris, 657-9; for survival of babies, 
653-5; ideal, 651; linear, 656-9; 
quadratic, 653-6 
dissociation of water, 57 
distance, between points, 84; as integral, 
357-8; as area, 262 

distributions, 569-615; binomial, 578— 
80, 595-7; change of variable in, 
590-2; correlation, 603, 608, 609, 
613; covariance, 603, 606, 608, 612-14, 
656-7; cumulative, 570-1, 580—1; 

exponential, 589; function (proba¬ 
bility density), 584, 591-3, 594, 612; 
one-variable Gaussian or normal, 
589-90,592-7.581, 617, 619; grouped, 
582—7, 604—5; invariance matrix or, 
613, 656-8; many-variable, 612-13; 
mean or expected value, 572, 576. 580, 
581, 587-94, 606, 608, 612—14; multi¬ 
nomial, 613-15; of birth weight, 653 
-5; of children in family, 597-600; of 
gestation time, 581—3, 618-19, 653-5; 
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of height, 592-4; of heights of father 
and son, 608, 609; of infant mortality 
and overcrou'ding, 604; one-variable, 
in general, 569-78, 580-92; Poisson, 
597; rectangular, 588-9; standard devi¬ 
ation of, 575. 588-94, 608, 616-17; 
two-variable, in general, 597-607 ; two- 
variable Gaussian, 607-ro, 652-3, 

655-6; variance of, 575-6, 580-1, 
588-92, 606, 608, 613, 630; see also 
sample 

distributive law for complex numbers, 

385-6 

divergence, 350-2 
divisibility, tests for, 10 
division, arithmetical, 15-16; by logar¬ 
ithms, 108-9; by nomogram, 153, 161; 
by slide rule, 150-1; of complex num¬ 
ber, 383; of fractions, 11-12,57-8; of 
matrices, 524-5; of polynomials, 30, 
31, 44-5; of probabilit>-, 549-50: of 
a straight line, 91, 103; Colson, 

667-8 

dog, 44, 672 
dominant, 543, 547 
doub, 663 

double integral. 446-50, 453-5, 539 

double series, 370 

drachm, 441 

dram, 440 

Drcyer, 120, 123 

Duarte, A. J., 349 

Dubois, 120, 157-61,465 

du Nouy, 146 

dyne, 443-4 


C » 3 » 

earth, size of, 142 
eccentric angle, 95 
eccentricity, 98-100 
Edinford, 515-16 
efficient estimation, 640-2 
eigen value, see latent roots 
elasticity, 34, 64 

electric charges and currents, 469-71; 
capacity, condenser, resistance, 475 - 6 ; 
conductivity, 147; oscillation, 475-8; 
units, 469; charge (quantity) as defin¬ 
ite iniegnil, 259 
electrocardiogram, 414 
electrolysis, 469, 472 
electromagnet, 474 

electromagnetic induction, 474-6, 478; 

radiation, 478; units, 469 
electromotive force, 474 “^ 
element, of determinant, 497; of matrix, 
514 

elimination of unknowns, 486-91, 493-6 
ellipse, 94-9: area, 264, 293: a* section 
of cylinder, 96-8; as conic section, 99 
elliptic integral, 434-S 
embryo, 234 


cncrg>', electric, 471; heat and mechan¬ 
ical, 214; kinetic. 454: in diet, 155-6; 
of gas, 208, 212, 214, 219 
England, 7, 34, 625-6 
epsilon symbol, 501-2, 504, 526-8 
equation {see also simultaneous equations, 
differential equation), Bessel, 367: 
cubic, 481-3; of gas, 34. 55-6, 308, 
213-14, 219-33. 277. 465-6; Newton’s 
method of solution, 483-6, 510-13: of 
circle, 93; of hj’perbola, 102: of para¬ 
bola, 91-2; of straight line, 86-7, 89; 
proof of existence of root, 425-7; poly¬ 
nomial, 425-7, 484-6; quadratic, 40; 
independent of units, 460-3 
equiangular spiral, 106-8, 393-4 
erg. 442-4 

error, experimental, 4-7; small, 199- 
202; proportional or percentage, 7. 
127, 200; correction lor, 200; in 
approximation, 216-18 
estimation, 616-21, 623, 637-45; com¬ 
bined, 623; efficient, 640-2: general, 
637-40; maximum likelihood, 632-8, 
640-2,644-5,648; minimum x ‘.642: 
of parameter. 617, 621, 635, 637-42; 
of regression, 647-9; of several para¬ 
meters, 642-5: of variance, 618-19. 
630: standard error of, 617-20, 621, 
639-45; 'I'weedic’s efficient, 641-3 
Euclid, 14 
Euler, L., 566-8 

evaluation, numerical, of area, 302-5; of 
detemiinant, 507; of inverse m.itnx, 
489, 526; of latent roots and vectors, 
534-7; of solution of linear equations, 
487-93; of general equations. 483-4), 
510-13; of polynomial, 44-5 
events, pnihabiliiy of, 540; conditional. 
549-50; independent, 544-6; succes¬ 
sive, 547-9 

Everitt's formula, 54-5 

exact dilfercntial equ.ition, 251-2, 338-9 

Exactus, 1 5 

existence of root, 425-7 

expansion, by row or column, 510; com¬ 
plete, 497-9 

expected value, for x'. 623-8; Ste olio 
mean (true) 

exponential decay, 3o(»: ilistrihution, 
589; growth, 243-6; series. 353 ‘ 5 . 

403. 568 

exponential function, 131-2, 393 5 , 

397-400. 402 

extrapolation, danger of, 42-3 
eyes, of cave fish. 4 

factorial, ascending and descending, 357 ’ 
61. 365; as integral. 564-^*; general¬ 
ized. 566 8; of integer, 330; reduced. 
358-/)!, 365; Stirling’s approximation 
to, 565-6 
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factorization, arithmetic, lo; algebraic, 
40-1, 44, 58-61; tests for, 10, 59; 
uniqueness of, 10, 24-6 
Fahrenheit scale, 34, 464 
Falconer, 625 

fallacy, 4, 5, 68-9, 193-4. 3” 

Famulener, 149 
farad, 472 
Faraday, 472 
fat, IS5 

Feldman, W. M., 145, 295, 322 

fiducial probability, 542 

figure, significant, 6^, 13; unknown, 7 

finger-print, 622 

Finney, D. J., 480, 627 

fish, cave, 4 

Fisher, R. A., 304, 528, 557, 564, 575, 
610, 620, 623, 632, 634, 636, 648, 657, 
683, 684 

fitting, of polynomial, 50-5, 648, 649 
flagpole, height of, 73 
Fletcher, A., 242 
flow of liquid, 456-9 
fluid drachm, ounce, 440 
flying-fish, 118 
focal length, 148, 479 
focus, of parabola, 91; of ellipse, 98-9; 
of h>-perbola, 101; relation with 
directrix, 98-100 

forces, unit of, 161,443-4; electromotive, 
474-6; magnetic, 473-45 psycho¬ 
logical, 57; triangle of, 374 

Fourier series, 412-14 
fourth-degree polynomial, 42, 44 
Fox, L., 491 

fraction, arithmetical, 11-12; algebraic, 
55-8, 415-31; equations containing, 
58: logarithm of, 113; partial, 415-31 
frequency of radiation, 479 
frustum, area of, 284 
function, definition of, 45; arbitrary, 254; 
Bessel, 367; beta, 567-8; branches of, 
138; complex, 390-405; defined by 
integral, 241; derived, 194, 401-2; 
distribution, see probability density; 
elliptic, 435; factorial or gamma, 330, 
566-8; generating, 559-^0; hj^er- 
bolic, 132-8; implicit, 93; inverse, 
137-9; many-valued, 138; many- 
variable, 47; notation for, 46-7; of 
function, 187-9; single-valued, 138; 
trigonometric, 71-83, I3^“7. 39S“403; 
two-variable, 47, 49 
functional relationship, 538-9 
fundamental note, 411 


galvanometer, 201 

gamma function, integral, 566-8 

gas constant, 466 

gas, energ^i' of, 208, 212, 214, 219; equa¬ 
tion and properties of (pressure, tem¬ 
perature, volume), 34, S5“6. 208, 212- 


14, 219-23, 277, 465-6; nomogram 
for, 161; specific heat of, 213-14, 219 
Gates, 624 

Gaussian distribution, 581, 589-90; 59*- 
7, 617, 619; for several variables, 
607-10, 652-3, 655 
Gaussian units, 469 
G.C.F., 27 

gene, 546-7; for baldness, 554-5 5 fo** 
blood-groups A-B-O, 547, 554, M-N, 
546, 552, Lewis, 643, Lutheran, 632; 
in peas, 549; in mice, 549; frequency 
of, 552-4 

generating function, 559-60 
genetics, 2, 4, 546-62; of dogs, 44; of 
blood-groups, 546-7 632, 643-4; of 
height, 32-6; of maize, 636; of mice, 
622, 624-5; of peas, 2, 549-50, 578 
genotype, probability of, 546-5? 
geometric rate of growth, 344, 348-9; 

series (progression), 119, 344-52 
geometry, i; analytic (algebraic), 68-83 
gestation time, 581-3, 618-19, 653-5 
Giorgi, G., 469; system of units, 441, 
442, 460, 464, 466 
Glaser, E. G., 465 
glomerulus, 459 
grade (unit of angle), 70 
gradient (slope) of line, 33-4. 165-6, 176 
graduation, see scale 
grain, 441 

grand total, 368-9, 629 
graph, 32-7, 42-4; of h>-perbolic func¬ 
tions, 133; of rational functions, 56; 
of trigonometric functions, 76; straight 
line, 143-7, . . 

graph paper, loganthmic, 142 
graphical aids, 140-62; see nomogram, 
slide rule, scale 
gravitational units, 444 
greatest common factor, 27 
Greek letters, 270, 575, 603, 673, 677 
Gregorian calendar, 436 
grouped distribution, 582-7, 604-5 
grouping, Sheppard’s correction for, 587, 
605 

groups, blood, see blood-groups 
growth, discontinuous, 242; exponential, 
243-6; geometric, 344, 348-9; line of, 
95; logistic, 244,246; of bacteria, 119, 
134, 224-5, 242, 246; of chicken em¬ 
bryo, 234; of child, 187, 320; of popu¬ 
lation, 242-6, 250; of schoolgirls, 41, 
42, 50, 119; pre-natal, 36 
Griineberg, H., 622, 624 
Guest, P. G., 648 
Guldberg, 466 

haemoglobin, 468-9 
haemorrhage, 649-50 
Haldane, J. B. S.. 528 
Hardy, G. H., 14, 552 
Hardy-Weinberg L^w, 552 
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hare, tortoise, snail, 333-4, 361 

harmonic (overtone), 411; motion, 409 

Harris. H., 554, 625, 631 

Hartman, 146, 625 

Haughton, S., 95 

H.C.F., 27 

healing of wound, 146 
heart, 145-6 

heat. 459-60, 464-6; units of, 214, 459; 
conversion factors, 460; electric, 109, 
47j; energy, 214; latent, 459: loss of, 
158, 160-1, 465: radiant, 478; pro¬ 
duced by infant, 123; value of diet, 
155-6 

hccto- (prefix), 438 

height, and pulse rate, 144-5; inheritance 
of, 32-6; of adults, 118, 142, 592-4; 
of distant object, 128; of fathers and 
sons, 32-6, 608-9: of schoolgirls, 41, 
42, 50, 119; sitting, 123; of flagpole. 
73 : of hill, 74 
henry. 475 
heredity, see K^'netics 
hetorogonic relation, lao-j 
hijthcr order derivative, 316, 329-34 
hiuh<>*^t common factor, 27 
Hill, A. V., ^(n) 
hill, height of, 74 
histogram, 570-1, 582-4 
historical note on calculus, 198 
Hollerith, 22, 24 
i lolt, S. li., 622 

homogeneity, of proportions, test for, 
620-1, 623-8; of means, 628-32 
homo^^eneous equations (differential) 
247-9, (simultaneous) 508 
Hooke's law, 34 
horse-power, 444 
]'iotcllin^% H., 528 
Hunter, j., 321 

hyperbola, definition and properties, 
100-4 1 rcclan^tular, 55 ^ 7 * '^4 J 
under, 263 

hyperbolic an«le, I 35 

hyperbolic functions, 132-9, 3 ^ 8 ; rela¬ 
tions involving, 136-^; derivative of, 

*87 

hyperbolic loj^arithrn, see loj^arithrn, 

natural • t 

hypothesis, probability of, 541; tesiinf; of, 
I (see also si^'nificance test) 

ideal discriminant, 651 

idealization of concepts, 8, 171—2, 230, 
242, 542-3. 5 ^ 4 . 598 
identity (equation), 30 
JmaKc (from lens), 479 
imaginary number, 382 
immersion in cold water, 465 
immiuratfon, 242, 244-5 
implicit function, 93 
impossibility, 542 

inbreeding, 528, 555^ 

2 / 


inconsistency of equations, 493-6, 506 
indefinite integral, definition of, 228; 
standard forms, 230-3; arbitrar>' con¬ 
stant in, 226-9; by parts, 231, 238-41 ; 
change of variable in, 234-8 ; complex. 
404-5; defining new function, 241; 
multiplication by constant, 228; nota¬ 
tion for, 229; of product, 237-41; of 
series, 364; of sum, 229; substitution 
in, 234-8 

independence, functional, 538; statistical, 
544-6, 625-8; of units, 462; of vari¬ 
ables, 201, 613 

index, refractisc, 478-9; laws (in alge¬ 
bra), 31 

inductance, coefficient of, 475 
induction, electromagnetic, 474-6, 478 
inequality, 62-7 

inertia, moment of, 454-5, 576 ; electrical, 
475 . 

intanc, mortality, 604; heat production, 

123 

infection of wound, 146 

infinite integral, 305-7 

infinity, 56, 177; sum to. 349 - 55 . i 70 

inflexion, point of, 319, 593 

information, 634-7 ; matrix, 643-5 

inoculation, 626 

insect, path of, ic6 

instantaneous velocity, 169-73 (see ol^o 
derivative) 

integral (see <tho indefinite integral, 
definite integral), standard forms, 230- 
3; beta, 567-8; cucflicicnts in equa¬ 
tions, 489-91; elliptic, 434-5; fac¬ 
torial, or gamma. 566-8; infinite, 
30O-7; irrational (uttli square root), 
433“5I nomial ((iaussian), 405, 594, 
675, 683; rational function {sec partial 
fractions), 415-31; repeafed, 44K-51, 
453-5, 539; multiple (double or sur¬ 
face, triple or volume), 446-50, 453-s, 
539; trigonometric, 40()-X, 411 13, 

432-3 

integrating factor, 252 
intensity of illumination, 479 
interchange of row or column. 502 4 
intermediate maxima and minima, 31 7-23 
international unit, 479-Ko 
interpolation, 49 - 55 . - 3 . 670, 673- 5 

invariance matrix, 613, 656, 658 
inverse function, 137^9; derivative of, 
190 

inverse matrix, 525-6, 528, 538; calcu¬ 
lation of. 489, 526 
inverse pr<ihal>jliiy, 542 
inverse square law, 306, 470 
inverse tangent series, 364 
inversion of e(juations. 4891 525-6; of 
cane sugar. 466-7 
ion, 57 

Iris, 620, 657-<> 
irrational integrals, 433-5 
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Isaac, 9 

isocritic line, 654 
Jacob, 9 

Jacobian matrix and determinant, 537'~9 
jaws, 44 
Jeffreys, H., 642 
Joachimsthal’s formulas, 91, 103 
joule, 214, 443, 449; equivalent of heat, 
460 

jump, in function, 178 

Kalmus, H., 625, 631, 649-50 
Karn, M. N., 581, 586, 621, 653-5 
Keetch, D. V., 368, 642 
Kelvin, 464 

Kendall, M. G., 597, 682 
kidneys, 458-9 
kinetic energ>', 454 
Kjeldahl, 149 


Lack, D., 345 
ladder problem, 189—90 
lapwing, 345-7. 349 
latent heat, 459 

latent roots and vectors, 528-37; calcu¬ 
lation of, 534-7 
Latvians’ blood-groups, 642 
law of cooling, 410; of ine<iualities, 63-4; 
of mass action, 466-9; of nature, 30; 
Ohm’s, 471 

Lawler, S. D., 368, 642 
laws of algebra, 30-2, 37-8, 373 “^*. 379 “ 
86, 515-22, 526-8 

L.C.^1., least common multiple, 27-8 
Lee. A., 32-4 
Leibnitz, 198, 229 


length, summary of formulas, 293; units 
and conversion factors, 438-9; of arc, 
J24-6, 293; of catenary, 289; of 
cur%e, as integral, 288-9, 293; of 
tlying-fish, 118; of head of locust, 344; 
of parabola, 289 
lens, 148, 162, 479 
lethal gene, 549 
Levens, A. S., 153 
Lewis blood-group, () 43-5 
light, properties of, 478-9: velocity of, 

472. 47 « . ^ o 

likelihood estimate, maximum, 632-8, 
640-2. 644-5, ^4^5 standard ern^r of, 
bU. 640 

limit, 127, 170, 172. 176-8; complex, 
401; formal definition, ig6-8; of 
error, 333-4; of sum, 276-7; Poisson, 
597; (terminus) of integration, 259, 
300,306-7 

line, of growth, 95 ; isocritic, 654; repre¬ 
senting vector. 37 «-2; density, 445 
linear discriminant, O56-9; 1. differen¬ 
tial equation, 250, 476-8; 1. simul¬ 
taneous equations, 486-96; 1. func¬ 

tion. 34; 1. interpolation, 52-3, 81-3 


linkage, 636-7, 640, 642 
liquid, 456—9 
litre, 440 

living things, size of, 141 

local maxima and minima, 43, 317-23 

locust, 344 

logarithmic comparisons, 118; log. 
graph-paper, 142; log. relation, 119- 
23, 142-6, 148-9; log. scale, 140-6, 
149, 151, 153, 157-61, 631 
logarithms, 106-32; tables of, 673-5, 
680-1; addition and subtraction, 
115-17; addition law, 107-8; approxi¬ 
mation to, 130; base of, 112, 124; 
characteristic of, 112-13; Colson, 671; 
common or Briggsian, iio-ii; com¬ 
plex, 390-3, 402; defined by area, 263; 
derivative of, 178-80, 186-7; general 
definition, 106; mantissa of, 112; 
many-valued, 397-8; modulus of, 129; 
natural, napierian or hyperbolic, 130; 
of fractions, 113; of negative numbers, 
114-iS; series for, 354-3. 403 5 sys¬ 
tems of, 110-12, 124, 129-32; use in 
finding powers and roots, 109-10, 123; 
use in multiplication and division, 
108-9 
logic, I 

logistic function, 134, 480 

London schoolgirls (L.C.C. report), 41-2, 

50, 119 

loop, area of, 278, 296-300, 304-5 

loss of heat, 15S, 160-1, 465 

lumen. 479 

lung, 295 

Lutheran, 632-4 

lux, 479 

Maclaurtn scries, see Taylor series 
Madsen, Th., 148-9 
magnetic properties, 473-4, 478 
magnitude, absolute (real), 64; (complex), 
383-4; (vector), 377; comparison of, 
62-7; directed, see vector, complex 
number 

maize, 636-7, 640, 642 
major axis, 95, 102 
mantissa, 112 

manual computation, 18-22 
many-valued function, 138; logarithm 
and powers, 397-8 
many-vTiriable distribution, 612-13 
mass, units and conversion factors, 440-1; 

action, 466-9; centre, 576 
mathematics, pure and applied, i; signi¬ 
ficance of, 672 

mating, random, 551-4; cousin, 555-6 
matrix {plural, matrices), definition of, 
368, 514; addition of, 515-16; anti¬ 
symmetric, 517; calculation of inverse, 
526; calculation of product, 519; cal" 
culation of latent roots, 534-7; charac¬ 
teristic equation of, 532, 534; complex 
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number as, 523-4; correlation. 613; 
covariance and variance, 613, 656-7; 
determinant of, 515-16, 525-8; division 
of, 524-5; in theor>’ of inbreeding, 
528. 556-7; invariance, 613, 656, 658; 
inverse or reciprocal, 525-6, 528, 538; 
Jacobian, 537-9: latent roots and vec¬ 
tors, 528-37; multiplication of, 516- 
22, 526-8, 538; non-singular, 525-6; 
simultaneous equations expressed by, 
524-6; singular, 525, 528: symmetric, 
517; subtraction, 516; transposed, 
5>5-i7, 5*9, 521; unit. 521, 528; vec¬ 
tor as, 522-3; zero, 515 
maxima and minima, 36-9, 43, 316-22, 
332-3.339-43; absolute, 43, 317; ter¬ 
minal, 317-18; for several variables, 
339-43 ; of rate of growth of child, 320 
maximum likelihood estimate, 632-9, 
640-42, 644, 645, 648; standard error 
of. 634. 640 

mean, of sample, 57** 576-8, 585-7, 
591-2, 599, 604, 605, 611-12. 618, 62 I; 
true or distribution, 572, 576. 580-81, 
587-99, 606, 608, 613-14; deviation. 
572-3; difference, 82, 670. 673-4; 
grand, 629; of sum, difference, and 
weighted combination, 610-11, 615; 
standard error of, 618-19: square. 572. 
629-31 ; test for difference of, 628-32; 
vector, 612-13; weighted, 452 
mechanical calculation, 14-18 
mechanical equivalent of heat, 214 
Mceh, 123 
mega- (prefix), 438 
Mendel. G., 2, 547 . 5 , 49 . 612, 636 
metabolism in bacterium, 211, 294 
metaphorical " rate ” of change, 167 
rnctrc-kilogram-second (MKS) system, 
441-2, 460. 464, 466 
metric system, 438-44 
mice, 549, 622, 624-5 
micro- (prefix), 438 
mil, 439 

Miller. J. C. P.. 242 
Milne-'Phomson, I-. M-, 116, 435 
minim, 440 

minimum, see maxima; 

minor, in determinant, 509-10, 5^^** 

axis, 95, 102 

minute of an^^le, 70; of ume, 430 
M-N blood-^<roups, 546-53. 5^11,613-14, 
642; «cnc frc<|uency, 552 
modulus, of real number, 64 ; of complex 
number, 383^; 'd elasiieily, 34 . ^>4 i 
<if logarithm, 129; of scale, 159 
Mohr, J., 62S-^> 
mole, 213, 4 ^ 4 . 4^6 
molecular wcitfhl, 213. 4 ^* 4 . 466 
mollusc, 95 

moment of inertia, 454-5. 576 
momentum, 454 
month numbeni, 437 


moon. 129 
Moronami, 227 

mortality, of lapvvinp, 345*^. 349; in¬ 
fant, 604 

motion, of charge, 473-6; cur\ ed. 194-6 
multinomial distribution, 613-151 ni. 

probability, 558—60; m. scries, 367-8 
multiple integral, see integral (double, 
triple, repeated) 

multiplication, arithmetical, 18, 23-4; by 
logarithms, 108-9; by nomogram, 
152-3; by punched cards, 23; by slide 
rule, 150-1; Colson, 665-6; of complex 
number, 379-81; of determinant, 
526-8, 550; of fractions, 11-12, 57-8: 
of Jacobians, 538; of matrices, 516-22, 
526-8, 538; (numerical) 519; of prob¬ 
ability', 544-8; of vector, 373-4; of 
polynomial. 30-1; of scries, 363 
Murray, H. A., 234 
mutation, 5 

nano- (prefix), 43K 
Napier, 130 

napierian, rer logarithm, natural 
natural selection, 5: n. law, 30; n. log.ir- 
ithm, see logarithm 
neg, 663 

negation, probability' of, 543 
negative area, 296-300; digits 660-3; 
number, 12, 114-15, 622-3; power, 
124 

nerve, 322 
Neville, E. H., 435 

Newton, I., 198, 356; law of cooling. 
244, 410; solution of equations, 483-6, 

neuton (unit of force), 16 c. 443, 444 
Ncyman, J., 642 
Ngboglus, 461 
nitrogen determination, 149 
noiiKigram, 140, 150-62; for ad^lifion, 
152-7, 162; for heat \alue of diet, 
155-6; tor colour index. 161; for 
Dubois formula, 157-61 ; for heat lo^^, 
158, 160-1 ; for multiplication, 152-^; 
fur IN thagoras's iheoreni, 162; for 
respiratory quotient, 161; for surface 
area, t57-fH ; fundamental theorem 
for, 153-5; modulus of scale of, 159; 
scales and graduations, 152-62; with 
three parallel scales, 153-60; vvillimore 
than tlirce scales, 160-2 
non-singular matrix, 525-6 
normal distribution, 581, 589-90, 7, 

617, 619, (many-\ariahle) 607-1 o, 

652-3, 655; n. integral, 405, 594* 6?^. 
683; n. temper.iture ami pressure, 
455 ; n. vision, 1 28 
notation. C*<ilst>n, 660-71 
N/r.J'.. 455 

nutnerui, Arahie, 660-2; C<dson, 6f)C 2; 
Homan, 660 i, 664 
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numerical evaluation of determinant, 507; 
of matrix product, 519; of inverse of 
matrix, 489, 526; of latent roots and 
vectors, 534-7; of solution of linear 
equations, 487-93; of solution of 
general equations, 483-6, 510-13; of 
polynomial, 44, 45; of trigonometric 
functions, 81-3; of definite integral or 
area, 302-5; of logarithm, 112 

oblate spheroid, 95 
odds, 543 

Ohm’s law, 199, 471, 476 
opposites, II 

order, of approximation, or small quan¬ 
tity, 328-9; of differentiation, 337-8; 
of reaction, 148-9; second, of differ¬ 
ential equation, 324, 409, 476-8; sec¬ 
ond, of derivative, 315, 319-32 
ordinary differential equation, see differ¬ 
ential equation 
ordinate, 69 

origin of co-ordinates, 35, 69 
oscillation, electrical, 475-8 
osmotic pressure, 45-6 
overcrowding, 604 
overtone, 41 r 

panmixia, 551-4 

parabola, 36; properties of, 91-2; area 
under, 263, 274-5 

paradox. 4. 5, 68-9, 167, 193-4. 209, 543 
parallel lines, 87 
parallelogram, area of, 284, 293 
parallelopipedon, volume of, 287-8, 294 
parameter, estimate of, 617, 621, 635, 

637-45 

parametric form, 96 

partial derivative, derivate, or differential 
coefficient, 202-25; change of \-ari- 
ables in, 209-10, 538; formal defin¬ 
ition, 216-18; higher order, 334-8 ; in 
constrained variation, 219-25; in a 
plane. 207; order of differentiation on, 
337-8 

partial fractions, with simple factors, 
415-20; with repeated factors, 421-4; 
with real factors, 428-31; with quad¬ 
ratic factors, 429-31 

parts, integration by, 231, 238-41, 308; 

real and imaginarj*, 386-8 
Pascal’s triangle, 358-61 
Pearl, 246 

Pearson, Karl, 32-4. 592, 623 
peas, genetics of, 2, 549, 550 
pendulum. 410 
penicillin, 480 

Penrose, I-,. S., 581, 586, 621,653-55, 658 
pepsin, 147. 193 
percentage error, 7, 127, 200 
permeability, magnetic, 474-8 
permittivity, 470-2 
perpendicular lines, 87 


petal length, 620, 657-9 
phase, 409 
phenol, 142—5 

phenylthiocarbamide (P.T.C.), 625, 626, 
628, 631 
TT (pi), 29, 307 
Piaggio, H. T. H., 243 
pico- (prefix), 438 

pivotal term, equation, 487-91, 506—7 
place, see decimal 
placenta praevia, 650 

plane, differentiation in, 207; vector, 

378-9 

planet, 95 

point, co-ordinates of, 69-70; of in¬ 
flexion, 319, 593 ; representing vector, 

371-2 

poise, 457 

Poiseuille, 455, 458-9 
Poisson limit distribution, 597 
polar co-ordinates, 70-2, 207-11 
polygon, area of, 301—2 
polynomial, 30, 32-55; cubic, 41-4: 
derivative of, 172-3; equations, 484-6; 
fitting of, 50-5, 648-9; factors of, 
58-61, 427-9; fourth degree, 42-4; 
numerical evaluation of, 44, 45; oper¬ 
ations with, 30-1; prime, 61; quad¬ 
ratic, 36-40; terms of, 30; approxi¬ 
mation to, 37 

population of England and Wales, 7, 34; 
of United Kingdom, 7; with immi¬ 
gration, 242, 244-5 
potential, 62, 471, 475“6 
poundal, 443 

pow’er (algebraic), complex, 381, 396-8; 
by logarithms, 109-10, 123; derivative 
of, 172-3, 190-1; many-valued, 397-8; 
negative, 124; sum of, 274; (rale of 
work), 443-4: (of lens), 479 
Powers-Samas, 22, 24 
practical calculation, see numerical 
pre-natal growth, 36 

pressure, 62, 445; unit of, 16, 455; of 
gas, 55-6, 161, 208, 212-14, 219-23; 
of hydrogen, 464; of liquid, 457-9; 
osmotic, 45-6 

prime number 10, 24-6; polynomial, 61 
prism, area of, 281-2, 293; volume of, 
285-6, 294 

probability, 540-63; definition of, 540; 
addition of, 543-4; approximate, 563; 
binomial, 557-63. 578-9; conditional, 
549-50; density of, 584. 591-2. 594, 
612; division of, 549-50; fiducial, 542; 
genetical, 543, 546-62; idealized defin¬ 
ition. 543; inverse, 542; multinomial, 
558-60; multiplication of, 544-8; of 
deviation, 594; of hypothesis, 541; of 
independent events, 544-6; of nega¬ 
tion, 543; of successive events, 547-9; 
of sur\’ival> 581, 654; relation to odds, 
543; represented by area, 583-4, 594“5 
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product, derivative of, i8z-4; integral of, 
237-41; sum of, 599-600; moment 
correlation, see correlation 
proijression, see series 
projectile, 36 
projection on line, 79 
prolate spheroid, 95 
pronunciation of sinh, tanh. 134 
proper value, see latent roots 
proportion, standard error of, 617, 619- 
20; test for difference in, 62 r, 623-8 
proportional error, 7, 127, 200 
protein, 155 
psychological forces, 57 
Pullig, 624 
pulse rate, 144-6 
pure mathematics, 1 

pyramid, volume of, 286-7, 294: tri¬ 
angular, 377 

Pythagoras, 12, 75, 84-5, 162, I95“6 
(|uad, 663 

(|uadratic, discriminant, 653-6; c<|ua- 
tion, 40; interpolation, 53; poly¬ 
nomial, 36-40 
c]uartic polynomial, 42-4 
quin, 663, 669-70 

quotient, derivative of, 185; rcspiraiorN, 
161 

rabbit, 43, i4S'6 

Race, R. R.. 368, 632, 642 

radian, 125, 180-1, 192 

radiation, 478 

radioactivity, 306-7, 589 

radius, of convergence, 4<^3 I gyration, 

576 

random. 542 ; mating, 55 > “4 
range, proportional, 118; of \ariaiion, 
118, 572, 587, 590; ot values ot cor¬ 
relation, 602-3, 6x2 

rate of change, instantaneous, see deri\- 
ative; average, 168-9, 172 3, 182, 18K; 
relations involving, 226 56 
rate of growth, 168-70, 320 
ratio, probabilitv, 6s 1-2; sex, 54tt 5^6; 
variance, 629-32. 676, 6K4 -5: trigon¬ 
ometric, see trigonometric function 
rational number, see fraction; tunction, 
see algebraic fractions 
reaction, chemical, 57, 148-9, 322, 464. 
469 

real number, 14; considered as complex, 
379. 381-2; real part, 3H6 H 
recessive, 543, 547; from cousin mar¬ 
riage, 555-^> , . o * 

reciprocal, of complex nufnt)er. 3^3. 

matrix, 525528; scale, 146 
recoml)inution fraction, 636 
rectangular distribution, 
rectangular hyperbola. SS"? 
recurrence relation, see reduction formula 
recurring decimal, 13 
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reduction formula, 433 
redundant equations, 493-6 
refraction of light, 148, 478-9 
regression (graph of means), 33, 35^, 
643-9 

relation, allometric or heterogenic, 1 20-3 ; 
between trigonometric functions, 75, 
77, 79, 80, 83, 136-7. 399-401 ; be¬ 
tween hsT^erbolic functions, 136, 137; 
focus-directrix, 98—100; functional, 
538-9; logarithmic, 119-23, 142-6, 

148, 140; parametric, 96 
relaxation method, 491-3 
remainder theorem, 59 
repeated integral, 448-51, 453-5 
resistance, electric, 199, 471, 475“6 
respiratory quotient, 161 
Riccati equation, 251 
ridge count, 622 
Robertson, 320 
Robinson, (j., 414 
rocket, 306 
Rogers, C. A., 274 
Roman numerals, 6<io-i, (^64 
root, by logarithms, 110, 123; complex, 
382, 395-6; of algebraic ecjuation'^, 
existence of. 425-7; of minus one, 
382, 523; of polynomials, 60; Nevs- 
ton*s method of finding, 510-13 

Kosenhead, L., 242 

row, expansion by, 510; e<|ual, 504; 
interchange, 502-4; total. 369, 597, 
600 

rule of signs. 374, 381-2 
Russian letter^ , 202 

.sad<ltcback, 342 

sample, random, 542, 569; agreement 
wilh expectation. 621 -32 ; calculaiion of 
covariance, 600, 604; calculation ot 
mean and variance, 57(1 8, 5X5 7, 509. 
604; coties lance, 600; correl.ition, 
603-'5, 609, 610, 612-13, 6i<^ 20, 

622 3; covariance, 590 <*05, 608, 

610-11. 614, 619, (fiiainx) t»56 - 7; 

deviaiue, or eorreeietl sum ot squares, 
577. 5^7. 599i from luniti’d popul.itMjn, 
569; grouped, 582-7, 604 5; histo¬ 
gram. 570-I, 582-4 ; mean, 57 J. 576 8. 
585-7, 591599. 604 5, 617, 61X, 
621; range, 572, 5^7 90; scatter dia¬ 
gram, 601; stereogram, 605-7; Man- 
dard deviation, 574 5* 578» 585. 587 8, 
591-2, 594, 599. 602, 604 5, 619. 630; 
variance, 573 -I. >76 8, 585-7. 591 2. 
591;, 604 5, 6j 8 -C9» 630 
Samuelson, P. A., 483. 534 
Sanger, H , 36K, 642 
scal.ir, 371 

scale, logarithmic. 140 6. 149. 153. 

157-61. 631; square root, 146 7; 

reciprocal, 146; of graduation. 165, 
176; of noinogran>, i53-^>2 
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Scammon, 36 

schoolgirls. 41-2, 50. 1x9 

Schiitz-BorisofF law, 147, 193 

scruple, 

sea urchin, 95 

sec, secant, sech, see trigonometric func¬ 
tions and hyperbolic functions 
second, of angle, 70 
second moment, 454 

second order, derivative, 315, 319-32; 
differential equation, 324, 409, 476-8; 
determinant, 499 

section, of cone, 99—101, 104—5; 

cylinder, 96—8 
sector, area of, 280-1, 293 
selection, natural, 5; of sample, 549 
self-fertilization, 636 
separable variables, 247 
separation of groups, 649-59 
series (see also Taylor series), 266-7, 
271-6, 344-68, 403; addition of, 363, 
403; antilogarithm, 353, 355; arith¬ 
metic and generalized arithmetic, 53, 
266-7, 271-6; arithmetico-geomctric, 
347; binomial, 355-60, 403, 560, 568; 
complex, 364, 403; convergence of, 
350-2, 370, 403, 576; differentiation 
and integration of, 364; divergence of, 
350-2; double, 370; exponential, 
353“5» 403. 568; in solution of differ¬ 
ential equations, 366-7; Fourier 
(trigonometric), 412-14; geometric, 

119, 344-52; geometric convergence 
of, 351, 352, 370. 403; inverse tangent, 
364: logarithmic, 354-5. 403; Mac- 
laurin, see Taylor series; manipu¬ 
lations with, 362-4, 368: multinomial, 
367-8; multiplication of, 363; power, 
see Taylor series; radius of conver¬ 
gence, 403; sine and cosine, 355, 403; 
sinh and cosh, 355; sum to infinity, 
349-55; tests for convergence, 352 
sex-difference, 631 
sex-ratio, 541, 596 
shape of cells in honeycomb, 323 
sight, normal, 128 
sigma notation, 269-71, 366 
sign of area, 296-300; of term in deter¬ 
minant, 498-9 

significance tests, 609, 620-32, 675-6, 
683-5; for differences in proportion 
(including x* test), 620, 622-8, 676, 
683; for difference in means (rough 
method), 621 ; (analysis of variance and 
“Student’s” /), 628-32, 676, 684-5; 
for difference in correlations (Fisher’s 
c-transfoimation), 622, 686 
significant figures, 6, 7, 13 
similar figures, 69 
simple harmonic motion, 409 
simultaneous equations, general, 486, 
510-13; linear, 486-96; successive 
approximations to solution, 49t-3. 


510-13; inversion of, 489, 525-6; 
determinant of, 496-7, 505-8, 525-6; 
homogeneotis, 508; with integral co¬ 
efficients, 489-91; in matrix form, 
524-5; inconsistency, redundancy, 
number of solutions, 493-6, 506 
sine, series for, 355, 403; table of, 680-1, 
674-5; trigonometric functions, 

hyperbolic functions 
single-valued function, 138, 397—8 
singular matrix, 525-8 
sinh, series for, 355; see hyperbolic func¬ 
tions 

sizes of natural objects, 141-2 
Sjoquist, 147 
Skellam, J. G., 367 
slice, volume of, 284-5 
slide rule, 150-1 
slope of line, 33-4, 165-6, 176 
small changes and errors, 199-225, 323- 
32, 335-7. 538; effect of one factor, 
*99. 323-32; effect of several factors, 
204-5, 335-7. 538; second approxi¬ 
mation, 326-7; general approximation, 
329-32, 335-7; orders of, 328-9; in 
matrix form, 538 
Smith, C. A. B., 627 
Snellen’s type, 128 
Soames, H., 227 
solidus, use of, 39 

solution of equations, uniqueness of, 
493-6, 506, 526; see also equation, 
differential equation, simultaneous 
equations, numerical evaluation 
Sommerville, D. M. Y., 105 
Soper, H, E., 357 
sound, velocity of, 192 
space vector, 371 

special solutions of differential equation, 
252 

specific heat, 464-5 

speed, 168, 195; yee velocity 

sphere, area of, 283-4, 293; volume of, 

287-94 

spheroid, area of, 291-3; volume of, 
291-2, 294 

spiral, equiangular, 106-8, 393-4 
square, completing the, 37-8, 39*-3. 434. 
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square root, evaluation of, 50-1, no, 123, 
484; positive, 40; complex 382, 
395-6; scale, 146-7; integration of 
functions containing, 433-5; of minus 
one, 382, 523-4 

squares, sum of, 577-8, 587, 629; law of 
inverse, 470 

standard derivatives, 191; integrals, 230— 

3; international, 480 
standard deviation, defined, 574; calcu¬ 
lation of, 576-8, 585-7; of binomial 
{= square root of variance), 579; of 
sample, 574-5. 578. 585, 587-8,. 59*-2, 
594, 599, 602, 604-S, 630; true or 
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distribution, 575, 588-94, 596, 608, 
616-17; standard error of, 619 
standard error, table of, 619; definition 
of, 617; of correlation, 619-20; of 
covariance, variance, standard de\i- 
ation, 619; of mean, 618-19; of pro¬ 
portion, 617, 619-20; of regression 
coefficient, 648; of difference, 621-2, 
630; of general estimate, 639-42; of 
maximum-likelihood estimate, 634, 
640; of estimate of several parameters, 
645 

sterc, 441 

stochastic, 542 {see probability) 
straight line(s), equation of, 86, 89; 
through given points, 86, 87; inter¬ 
section of, 87; angle between, 87, 89; 
division of, 90, 91; distance from a 
point, 87 

“ Student’s ” /, 630, 648, (table) 684-5, 
676 

substitution, in integral, 234-8, 307-12; 
in equations, 486-91 

subtraction by logarithms, ii 5 “* 7 ; 
by nomogram, 153; by slide rule, 150; 
of complex number, 384-6; of frac¬ 
tions, II, 12, logarithms, 

107-9; of matrices, 517; of poly¬ 
nomials, 30-1 ; of series, 363; of vec¬ 
tors, 376; abolished in Colson nota¬ 
tion, 665 

successive approximation, to small change, 
326-7, 329-32: in solution of cciua- 
tions, 483-6, 491-3, 510-13 
sugar, inversion of, 466-7 
sulphur in bacterium, 296 
sum (see also total); definite integral as 
limit of, 264-6, 276-8; of arithmetic 
series, 269; of arithmetico-geoinelric 
series, 347; of cubes, 274-5; 
generalized arithmetic scries, 272—3; of 
geometric series, 346; of independent 
variables, 613; of odd numbers, 268; 
of powers, 274: of products of obser¬ 
vations, 599-600; of squares of obser¬ 
vations, 577-8, 587. 629; Rogers’s 

convention for, 274; sigma notation 
for, 269-71, 366; to infinity, 344-55 
sun, 95 , 

surface, nomogram for area of, i57-t»i, 
area and weight, 146; density, 445* 
integral, 446-50; 453 - 5 , 5391 <'1 

revolution, 290, 293-4 
survival, chance of, 581, 654 
Rusceptibility to typhoid, 626 
swimming in cold water, 465 
symmetric matrix, 517 
symmetry of ellipse, 95 
system of logarithms, i lo-i i , 124, 129-32 

I, 630, 648, 676, (table) 684-5 
table of c<^mmon logarithms, 678-9, 
(cxpl.) C73-5 : of X*. 6 « 3 . (expl.), 676; 


correlation, 683, 686, (expl.) 675, 
676, 677: Greek alphabet, 677, (expl.) 
673 ; normal integral, 683, (expl.) 675 ; 
sines 680-1, (expl.) 674-5; variance 
ratio and t, 684-5, (expl.) 676; 
^-transformation, 686, (expl.) 676; 
conversion factors (area), 439. (force, 
energy, power) 443-4. (heat) 460, 
(length) 439, (mass) 440-1, (pressure) 
455i (temperature) 464, (velocity) 441, 
(\olumc) 440; month numbers, 437; 
reduced factorials, binomial coefficients, 
359; atomic weights, 464: relations 
between trigonometric and between 
hyperbolic functions, 136—7; standard 
derivatives, 191; standard integrals, 
230-3 

tables, use ot, 52-5, 81-3, 670, 673-5 
Talbot, 123 

tangent, see trigonometric functions, 
hyperbolic functions; gaKanonieter, 
201; inverse, 364; to a curve, 174 
tanh, see hyperbolic functions; pronunci¬ 
ation, 134 

taste tc.st, 228. 625-f) 

'I'aylor series, 353-5, 361-2, 364-5. 367, 
403-5. 483. 565: convergence of, 361, 
403; complex. 364, 403-5; correct¬ 
ness of, 361 2. 403; multiple, 367; 
uniqueness, 365; /or pariuiilur funi- 
lions, see under series 
temperature, of gas, 34, 208, 212-14, 
219-23: scales anti units, 34, (>2, 404: 
absolute, if»i, 464; and pulse rate, 

145-6 

lemi, in determinant, 498-9, 526-K; t)f 
polynomial, 30 

terminal m.iximum ami minimum. 317-18 
terminus or limit oI integration, 259, 300, 
306 -7 

test, fi>r lactors, 10, 59; taste. ^>25-6, ^.28; 

significance, see significance tests 
tctanolysin, 148 
tetrahedron, 377 
third-order detcrminaiU, 490 
'I'homp^on, d’.\rc\ \\., 95. 3--. .'44 
thread, tension in. 34 
throwback, 55 

time, 436 8; of geslatitin, 581-3. <.i S -19, 
653 - 5 ; it! Colson form. <>(n 
tortoise, 334 . . 350 , 361 

total, of observations, 571, 57f> 8, 585, 
586, f»IO II, f)25, (I 2 (>, 629, 631: JOA 
and column. 369. 5 V 7 , 600; graml, 3t>K, 
3O9, O29; total derivative, see deriv.i- 

tive 

train, 164-8 
iransfonner, 474-5 

transpose, ol determinant, 502, 515; <d 
matrix, 515-17. 5 * 9 . 

triangle, 69; urea of, 284, 293,302; cen¬ 
troid of, .15.4: mediafis ot, 376, 454; 
of vectors. 374-9; I*ascal, 358 t>i 
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triangular pyramid, 377 
trigonometric functions, 71-2, 77; addi¬ 
tion formulas, 79-80, 83, 256; relations 
between, 136-^; complex, 398-403; 
derivative of, 180-1, 184-7; evaluation 
of, 81-2; graphs of, 76; integral of, 
230-3,432; of 30® and 45°, 73, 78; of 
15° and 75°, 78 
trip, 663 

triple integral, 446-9. 453 - 5 , 539 
true (mean, variance, etc.), see distri¬ 
bution 

Tweedie, M. C. K., estimation method of, 

641-3 

two-valued function, 138 
two-variable distribution, 597-610 
typhoid, 626 

unending decimal, 13, 93 
unique solution of equations, 493-6, 506, 
526 

unit matrix, 521, 528 
unit vector, 378 

units, of angle, 70, 125; of area, 439; 
assay (international), 479—80; of bright¬ 
ness, 479; electric and magnetic, 469- 
75; of energ>', power, 443 - 4 , 459 - 63 : 
of power (of lens), 479; of force, 161, 
443-4; of heat, 214; of length, 438-9; 
of pressure, 161, 4551 of \’iscosity, 
457; of volume, 440; change of, 442, 
443, 460-3, 590-1, 602; British, 160, 
438-44, 592; metric, 441-2, 460, 464; 
derived, 442-3; independence of, 
460-3 

unknown, elimination of, 486-91, 493“6 
unweighted total, 600 
urine, flow of, 458—9 
uterus, shape of, 95 

value, calorific, 155; absolute, see abso¬ 
lute magnitude 
van der Monde, 365 

variable, dependent or independent, 201; 
change of, in integral, 234-8, 307-12, 
539; separable, in differential equation, 
247; differential equation solvable for, 
249-SO 

variance, defined, 573 , 575 : of sample, 
573-8, 585-7, 591-2. 599, 604-5, 608, 

619,630; of distribution, 575-6, 580-1, 
588-92, 606, 608, 613, 630; of sum, 
difference, weighted combination, 610- 
II, 613, 615; estimation of, 618-19, 
630; standard error of, 619; ratio, 
629-32, 676, 684-5; analysis of, 574, 
628-32. 676 

variation, constrained, 219-25 
vector, 371-8; addition, 374-^1 column 
and row, 522—3; component of, 371—2; 
magnitude or modulus of, 377; mean, 
612-13; plane. 378-9; representing 


complex number, 383 ; spatial, 371; 
subtraction 376; unit, 378; zero, 

371 ,374 ^ ^ 

velocity, instantaneous, 169-73, 226-8 
{see ^50 derivative); average, 164-9, 
172-3; of chemical reaction, 148-9, 
464-8; of light, 472, 478; of sound, 
192; integr^ of, 257-8; units and 
conversion factors, 441 
Verhulst, 246 
vertex, 91, 99-100. 102 
vibration, 408—14 
violin string, 410—14 
viscosity 456-8 
vision, 128 
vitamin, 480 

volume, summary of formulas, 294; as 
definite integral, 284, 446—7, 449, 
453-5, 539; of bacterium, 294; of 
cone, 286-7, 294; of cylinder, 285-6, 
291, 294; of gas, 55-6, 208, 212-14, 
219-23; of lungs, 295; of parallelo- 
pipedon, 287-8, 294; of prism, 285-6, 
294; of pyramid, 286^, 294; of 
sphere, 287, 294; of spheroid, 291-2, 
294; of thin slice, 284-5; of surface of 
revolution, 290, 294; units and con> 
version factors of, 440 

Waage, 466 
Walbum, 148-9 
Wales, 7, 34 
water, 57 

Watson, G. N., 367 
watt, 442, 444, 471 
w'avelength, 479 
weight, atomic, 463 

weight, and sitting height, 123; and sur¬ 
face area, 146; at birth, 653-5; of 
schoolgirls, 41, 42, 50, 119 
weighted estimate, 623; combination, 
610-11,613-15,623; mean, 452 
Weinberg, 552 
Whittaker, E. T., 367, 414 
work, as definite integral, 276-7, 300 
wound, healing of, 146 
Wright, S., 528 

AT-axis, 35, 69 

x-co-ordinate (abscissa), 69 

Yates, F., 304, 564, 569, 624, 648, 684 
>’-axis, 35, 69 

3’-co-ordinate (ordinate), 69 
yellow mice, 549; yellow peas, 2, 549—50, 
578, 620 

Yule, G, U., 597, 626, 682 

zero complex number, 381; correlation, 
603; .determinant, 504; matrix, 515; 
vector, 371, 374; di\nsion by, 5 
^-transformation, 620, 622, 676, 686 
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